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Description 

Technical Field 

[0001] The present invention relates to a method and 
apparatus for seamlessly reproducing a bitstream hav- 
ing non-sequential system clock data therein and, more 
specifically, to a bitstream for use in an authoring system 
for variously processing a data bitstream comprising the 
video data, audio data, and sub-picture data constituting 
each of plural program titles containing related video da- 
ta, audio data, and sub-picture data content to generate 
a bitstream from which a new title containing the content 
desired by the user can be reproduced, and efficiently 
recording and reproducing said generated bitstream us- 
ing a particular recording medium. 

Background Art 

[0002] Authoring systems used to produce program 
titles comprising related video data, audio data, and 
sub-picture data by digitally processing, for example, 
multimedia data comprising video, audio, and sub-pic- 
ture data recorded to laser disk or video CD formats are 
currently available. Systems using Video-Cds in partic- 
ular are able to record video data to a CD format disk, 
which was originally designed with an approximately 
600 MB recording capacity for storing digital audio data 
only, by using such high efficiency video compression 
techniques as MPEG. As a result of the increased ef- 
fective recording capacity achieved using data com- 
pression techniques, karaoke titles and other conven- 
tional laser disk applications are gradually being trans- 
ferred to the video CD format. 

[0003] Users today expect both sophisticated title 
content and high reproduction quality. To meet these ex- 
pectations, each title must be composed from bit- 
streams with an increasingly deep hierarchical struc- 
ture. The data size of multimedia titles written with bit- 
streams having such deep hierarchical structures, how- 
ever, is ten or more times greater than the data size of 
less complex titles. The need to edit small image (title) 
details also makes it necessary to process and control 
the bitstream using low order hierarchical data units. 
[0004] It is therefore necessary to develop and prove 
a bitstream structure and an advanced digital process- 
ing method including both recording and reproduction 
capabilities whereby a large volume, multiple level hier- 
archical digital bitstream can be efficiently controlled at 
each level of the hierarchy. Also needed are an appara- 
tus for executing this digital processing method, and a 
recording media to which the bitstream digitally proc- 
essed by said apparatus can be efficiently recorded for 
storage and from which said recorded information can 
be quickly reproduced. 

[0005] Means of increasing the storage capacity of 
conventional optical disks have been widely researched 
to address the recording medium aspect of this problem. 



One way to increase the storage capacity of the optical 
disk is to reduce the spot diameter D of the optical (laser) 
beam. If the wavelength of the laser beam is I and the 
aperture of the objective lens is NA, then the spot diam- 
5 eter D is proportional to l/NA, and the storage capacity 
can be efficiently improved by decreasing I and increas- 
ing NA. 

[0006] As described, for example, in United States 
Patent 5,235,581 , however, coma caused by a relative 

10 tilt between the disk surface and the optical axis of the 
laser beam (hereafter "tilt") increases when a large ap- 
erture (high NA) lens is used. To prevent tilt-induced co- 
ma, the transparent substrate must be made very thin. 
The problem is that the mechanical strength of the disk 

15 }s low when the transparent substrate is very thin. 
[0007] MPEG1 , the conventional method of recording 
and reproducing video, audio, and graphic signal data, 
has also been replaced by the more robust MPEG2 
method, which can transfer large data volumes at a 

20 higher rate. It should be noted that the compression 
method and data format of the MPEG2 standard differ 
somewhat from those of MPEG1 . The specific content 
of and differences between MPEG1 and MPEG2 are de- 
scribed in detail in the ISO-1 1 1 72 and ISO-1 381 8 MPEG 

25 standards, and further description thereof is omitted be- 
low. 

[0008] Note, however, that while the structure of the 
encoded video stream is defined in the MPEG2 specifi- 
cation, the hierarchical structure of the system stream 
30 and the method of processing lower hierarchical levels 
are not defined. 

[0009] As described above, it is therefore not possible 
in a conventional authoring system to process a large 
data stream containing sufficient information to satisfy 

35 many different user requirements. Moreover, even if 
such a processing method were available, the proc- 
essed data recorded thereto cannot be repeatedly used 
to reduce data redundancy because there is no large 
capacity recording medium currently available that can 

40 efficiently record and reproduce high volume bitstreams 
such as described above. 

[0010] More specifically, particular significant hard- 
ware and software requirements must be satisfied in or- 
derto process a bitstream using a data unit smallerthan 

45 the title. These specific hardware requirements include 
significantly increasing the storage capacity of the re- 
cording medium and increasing the speed of digital 
processing; software requirements include inventing an 
advanced digital processing method including a sophis- 

50 ticated data structure. 

[0011] Therefore, the object of the present invention 
is to provide an effective authoring system for controlling 
a multimedia data bitstream with advanced hardware 
and software requirements using a data unit smaller 

55 than the title to better address advanced user require- 
ments. 

[001 2] To share data between plural titles and thereby 
efficiently utilize optical disk capacity, multi-scene con- 
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trol whereby scene data common to plural titles and the 
desired scenes on the same time-base from within multi- 
scene periods containing plural scenes unique to par- 
ticular reproduction paths can be freely selected and re- 
produced is desirable. 

[0013] However, when plural scenes unique to a re- 
production path within the multi-scene period are ar- 
ranged on the same time-base, the scene data must be 
contiguous. Unselected multi-scene data is therefore 
unavoidably inserted between the selected common 
scene data and the selected multi-scene data. The prob- 
lem this creates when reproducing multi-scene data is 
that reproduction is interrupted by this unselected scene 
data. 

[0014] In other words, except when a VOB, which is 
normally a single-stream title editing unit, is divided into 
discrete streams, seamless reproduction cannot be 
achieved by simply connecting and reproducing individ- 
ual VOB. This is because while the reproduction of vid- 
eo, audio, and sub-picture streams forming each VOB 
must be synchronized, the means for achieving this syn- 
chronization is enclosed in each VOB. As a result, the 
synchronization means will not function normally at VOB 
connections if the VOB are simply connected together. 
[0015] The international patent application WO - A - 
95/12179 discloses a data stream which contains two 
alternatively reproducible video presentations. The data 
stream which is disclosed in this documents contains 
three types of data blocks. Two types of data blocks be- 
long to different versions of video presentations where- 
as a third type of data block is common to both versions. 
If a particular video version shall be replayed the non- 
selected video data units have to be jumped over in or- 
der to go to the next block. 

[001 6] The object of the present invention is therefore 
to provide a seamless ,or in some cases, non-seamless 
reproduction. 

[0017] This objective is achieved according to the 
present invention by a machine readable optical disc- 
containing a title having a first data stream and a second 
data stream, said disc comprising: said first data stream 
containing video data; said second data stream contain- 
ing video data; and a management information storage 
area comprising management information, wherein the 
management information contains a flag operable to in- 
dicate, when said optical disc is read by a machine, 
whether or not the second data stream shall be - during 
reproduction - presented seamlessly after presentation 
of the first data stream, such that the presentation ter- 
minating time of the video data of the first data stream 
is equivalent to the presentation starting time of the vid- 
eo data of the second data stream. 
[0018] According to the present invention, this objec- 
tive is also achieved by a reproduction apparatus for re- 
producing an optical disc containing a title) having a first 
data stream and a second data stream, said disc con- 
taining said first data stream containing video data, said 
second data stream containing video data, and a man- 



agement information storage area comprising manage- 
ment information, wherein the management information 
contains a flag operable to indicate whether or not the 
second data stream shall be presented seamlessly after 

5 presentation of the first data stream, said reproduction 
apparatus comprising: a reading arrangement operable 
to read the management information and the first and 
second data streams from the optical disc; and a deter- 
mining arrangement operable to determine, based on 

10 the flag, whether or not to present the second data 
stream seamlessly after presentation of the first data 
stream such that the presentation terminating time of the 
video data of the first data stream is equivalent to the 
presentation starting time of the video data of the sec- 

15 ond data stream. 

[001 9] Further, according to the present invention, the 
object is also achieved by a reproduction method for re- 
producing an optical disc containing a title having a first 
data stream and a second data stream, said disc con- 

20 taining said first data stream containing video data, said 
second, data stream containing video data, and a man- 
agement information storage area comprising manage- 
ment information, wherein the management information 
contains a flag operable to indicate whether or not the 

25 second data stream shall be presented seamlessly after 
presentation of the first data stream, said reproduction 
method comprising: reading the management informa- 
tion and the first andsecond data streams from the op- 
tical disc; and determining, based on the flag, whether 

30 or not to present the second data stream seamlessly af- 
ter presentation of the first data stream such that the 
presentation terminating time of the video data of the 
first data stream is equivalent to the presentation start- 
ing time of the video data of the second data stream. 

35 

Brief Description of Drawings 
[0020] 

40 Fig. 1 is a graph schematically showing a structure 
of multi media bit stream according to the present 
invention, 

Fig. 2 is a block diagram showing an authoring en- 
coder according to the present invention, 

45 Fig. 3 is a block diagram showing an authoring de- 
coder according to the present invention, 
Fig. 4 is a side view of an optical disk storing the 
multi media bit stream of Fig. 1 , 
Fig. 5 is an enlarged view showing a portion con- 

50 fined by a circle of Fig. 4, 

Fig. 6 is an enlarged view showing a portion con- 
fined by a circle of Fig. 5, 

Fig. 7 is a side view showing a variation of the op- 
tical disk of Fig. 4, 
55 Fig. 8 is a side view showing another variation of 
the optical disk of Fig. 4, 

Fig. 9 is a plan view showing one example of track 
path formed on the recording surface of the optical 
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disk of Fig. 4, 

Fig. 10 is a plan view showing another example of 
track path formed on the recording surface of the 
optical disk of Fig. 4, 

Fig. 11 is a diagonal view schematically showing 5 
one example of a track path pattern formed on the 
optical disk of Fig. 7, 

Fig. 12 is a plan view showing another example of 
track path formed on the recording surface of the 
optical disk of Fig. 7, 10 
Fig. 13 is a diagonal view schematically showing 
one example of a track path pattern formed on the 
optical disk of Fig. 8, 

Fig. 14 is a plan view showing another example of 
track path formed on the recording surface of the « 
optical disk of Fig. 8, 

Fig. 15 is a graph in assistance of explaining a con- 
cept of parental control according to the present in- 
vention, 

Fig. 1 6 is a graph schematically showing the struc- 20 
ture of multimedia bit stream for use in Digital Video 
Disk system according to the present invention, 
Fig. 1 7 is a graph schematically showing the encod- 
ed video stream according to the present invention, 
Fig. 1 8 is a graph schematically showing an internal 25 
structure of a video zone of Fig. 1 6. 
Fig. 1 9 is a graph schematically showing the stream 
management information according to the present 
invention, 

Fig. 20 is a graph schematically showing the struc- 30 
ture the navigation pack NV of Fig. 17, 
Fig. 21 is a graph is assistance of explaining a con- 
cept of parental lock playback control according to 
the present invention, 

Fig. 22 is a graph schematically showing the data 35 
structure used in a digital video disk system accord- 
ing to the present invention, 
Fig. 23 is a graph in assistance of explaining a con- 
cept of Multi-angle scene control according to the 
present invention, 40 
Fig. 24 is a graph in assistance of explaining a con- 
cept of multi scene data connection, 
Fig. 25 is a block diagram showing a DVD encoder 
according to the present invention, 
Fig. 26 is a block diagram showing a DVD decoder 45 
according to the present invention, 
Fig. 27 is a graph schematically showing an encod- 
ing information table generated by the encoding 
system controller of Fig. 25, 

Fig. 28 is a graph schematically showing an encod- 50 
ing information tables, 

Fig. 29 is a graph schematically showing an encod- 
ing parameters used by the video encoder of Fig. 
25, 

Fig. 30 is a graph schematically showing an exam- 55 
pie of the concents of the program chain information 
according to the present invention, 
Fig. 31 is a graph schematically showing another 



example of the contents of the program chain infor- 
mation according to the present invention, 
Fig. 32 is a block diagram showing a synchronizer 
of Fig. 26 according to the present invention, 
Fig. 33 is a graph in assistance of explaining a con- 
cept of multi-angle scene control according to the 
present in invention, 

Fig. 34 is a flow chart, formed by Figs. 34A and 34B, 
showing an operation of the DVD encoder of Fig. 25 
, Fig. 35 is a flow chart showing detailed of the en- 
code parameter production sub-routine of Fig. 34, 
Fig. 36 is a flow chart showing the detailed of the 
VOB data setting routine of Fig. 35, 
Fig. 37 is a.flow chart showing the encode param- 
eters generating operation for a seamless switch- 
ing, 

Fig. 38 is a flow chart showing the encode param- 
eters generating operation for a system stream, 
Fig. 39 is a block diagram showing the STC gener- 
ator of Fig. 32, 

Fig. 40 is a graph in assistance of explaining the 
realtionship the relationship between the SCR, 
APTS, VDTS, and VPTS values, 
Fig. 41 is a block diagram showing a modification 
of the synchronizer of Fig. 32, 
Fig. 42 is a block diagram showing a synchroniza- 
tion controller of Fig. 41 , 

Fig. 43 is a flow chart showing an operation of the 
syncronization controller of Fig. 42, 
Fig. 44 is a graph in assistance of explaining the 
relationship between the system clock reference 
SCR, the audio playback starting time information 
APTS, the decoder reference clock STC, and the 
video playback starting time VPTS, 
Fig. 45 is a graph in assistance of explaining the 
relationship between the recording positions and 
values of SCR, APTS, and VPTS when VOB #1 and 
VOB #2 are seamlessly reproduced, 
Fig. 46 is a graph in assitance of explaining the re- 
lationship between the SCR, APTS, and VPTS val- 
ues and recording positions in each VOB, 
Fig. 47 is a graph in assistance of explaining the 
relationship between the SCR, APTS, and VPTS 
values and recording positions in the VOB, 
Fig. 48 is a graph showing a time line from input of 
the VOB in Fig. 47 to the system decoder to output 
of the last audio and video reproduction data, 
Fig. 49 is a flow chart showing the operation of the 
DVD encoder of Fig. 26, 

Fig. 50 is a flow chart showing details of the multi- 
angle non-seamless switching control routine of 
Fig. 49, 

Fig. 51 is a flow chart showing details of the multi- 
angle seamless switching control routine of Fig. 49, 
Fig. 52 is a flow chart showing details of the parental 
lock sub-routine of Fig. 49, 
Fig. 53 is a flow chart showing details of the single 
scene subroutine of Fig 49, 
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Figs. 54 and 55 are graphs showing decoding infor- 
mation table produced by the decoding system con- 
troller of Fig. 26, 

Fig. 56 is a flow chart showing the operation of the 
DVD decoder DCD of Fig. 26, 5 
Fig. 57 is a flow chart showing details of reproduc- 
tion extracted PGC routing of Fig. 56, 
Fig. 58 is a flow chart showing details of decoding 
data process of Fig. 57, performed by the stream 
buffer, 10 
Fig. 59 is a flow chart showing details of the decoder 
synchronization process of Fig. 58, 
Fig. 60 is a flow chart showing an during non-seam- 
less reproduction operation of the STC selection 
controller of Fig. 59, 15 
Fig. 61 is a flow chart showing the operation of the 
STC selection controller of Fig. 39 during seamless 
reproduction operation, 

Fig. 62 is a flow chart showing the data transfering 
operation of Fig. 57, 20 
Fig. 63 is a flow chart showing details of the non 
multi-angle decoding process of Fig. 62, 
Fig. 64 is a flow chart showing details of the non- 
multi-angled interleave process of Fig. 63, 
Fig. 65 is a flow chart showing details of the non- 25 
multi-angled contiguous block process of Fig. 63, 
Fig. 66 is a flow chart showing an modification of 
Fig. 63, 

Fig. 67 is a flow chart showing details of the seam- 
less multi-angle decoding process of Fig. 62, 30 
Fig. 68 is a flow chart showing details of non-seam- 
less multi-angle decoding process of Fig. 62, 
Fig. 69 is a block diagram showing details of the 
stream buffer of Fig. 26, 

Fig. 70 is a flow chart showing the encode param- 35 
eters generating operation for a system stream con- 
taining a single scene, 

Fig. 71 is a graph schematically showing an actual 
arrangement of data blocks recorded to a data re- 
cording track on a recording medium according to 40 
the present invention, 

Fig. 72 is a graph schematically showing contigu- 
ous block regions and interleaved block regions ar- 
ray, 

Fig. 73 is a graph schematically showing a content 45 
of a VTS title VC3S according to the present inven- 
tion, and 

Fig. 74 is a graph schematically showing an internal 
data structure of the interleaved block regions ac- 
cording to the present invention. 50 

Best Mode for Carrying Out the Invention 

[0021] The prevenc invention is detailedly described 
with reference to the accompanying drawings. 55 



Data structure of the authoring system 

[0022] The logic structure of the multimedia data bit- 
stream processed using the recording apparatus, re- 
cording medium, reproduction apparatus, and authoring 
system according to the present invention is described 
first below with reference to Fig. 1 . 
[0023] In this structure, one title refers to the combi- 
nation of video and audio data expressing program con- 
tent recognized by a user for education, entertainment, 
or other purpose. Referenced to a motion picture (mov- 
ie), one title may correspond to the content of an entire 
movie, or to just one scene within said movie. 
[0024] A video title set (VTS) comprises the bitstream 
data containing the information for a specific number of 
titles. More specifically, each VTS comprises the video, 
audio, and other reproduction data representing the 
content of each title in the set, and control data for con- 
trolling the content data. 

[0025] The video zone VZ is the video data unit proc- 
essed by the authoring system, and comprises a spe- 
cific number of video title sets. More specifically, each 
video zone is a linear sequence of K + 1 video title sets 
numbered VTS #0 - VTS #K where K is an integer value 
of zero or greater. One video title set, preferably the first 
video title set VTS #0, is used as the video manager 
describing the content information of the titles contained 
in each video title set. 

[0026] The multimedia bitstream MBS is the largest 
control unit of the multimedia data bitstream handled by 
the authoring system of the present invention, and com- 
prises plural video zones VZ. 

Authoring encoder EC 

[0027] A preferred embodiment of the authoring en- 
coder EC according to the present invention for gener- 
ating a new multimedia bitstream MBS by re-encoding 
the original multimedia bitstream MBS according to the 
scenario desired by the user is shown in Fig. 2. Note 
that the original multimedia bitstream MBS comprises a 
video stream St1 containing the video information, a 
sub-picture stream St3 containing caption text and other 
auxiliary video information, and the audio stream St5 
containing the audio information. 
[0028] The video and audio streams are the bit- 
streams containing the video and audio information ob- 
tained from the source within a particular period of time. 
The sub-picture stream is a bitstream containing mo- 
mentary video information relevant to a particular scene. 
The sub-picture data encoded to a single scene may be 
captured to video memory and displayed continuously 
from the video memory for plural scenes as may be nec- 
essary. 

[0029] When this multimedia source data St1, St3, 
and St5 is obtained from a live broadcast, the video and 
audio signals are supplied in real-time from a video cam- 
era or other imaging source; when the multimedia 
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source data is reproduced from a video tape or other 
recording medium, the audio and video signals are not 
real-time signals. 

[0030] While the multimedia source stream is shown 
in Fig. 2 as comprising these three source signals, this 
is for convenience only, and it should be noted that the 
multimedia source stream may contain more than three 
types of source signals, and may contain source data 
for different titles. Multimedia source data with audio, 
video, and sub-picture data for plural titles are referred 
to below as multi-title streams. 
[0031] As shown in Fig. 2, the authoring encoder EC 
comprises a scenario editor 100, encoding system con- 
troller 200, video encoder 300, video stream buffer 400, 
sub-picture encoder 500, sub-picture stream buffer 600, 
audio encoder 700, audio stream buffer 800, system en- 
coder 900, video zone formatter 1300, recorder 1200, 
and recording medium M. 

[0032] The video zone formatter 1 300 comprises vid- 
eo object (VOB) buffer 1000, formatter 1100, and vol- 
ume and file structure formatter 1400. 
[0033] The bitstream encoded by the authoring en- 
coder EC of the present embodiment is recorded by way 
of example only to an optical disk. 
[0034] The scenario editor 100 of the authoring en- 
coder EC outputs the scenario data, i.e., the user-de- 
fined editing instructions. The scenario data controls ed- 
iting the corresponding parts of the multimedia bitstream 
MBS according to the user's manipulation of the video, 
sub-picture, and audio components of the original mul- 
timedia title. This scenario editor 100 preferably com- 
prises a display, speaker(s), keyboard, CPU, and 
source stream buffer. The scenario editor 100 is con- 
nected to an external multimedia bitstream source from 
which the multimedia source data St1 , St3, and St5 are 
supplied. 

[0035] The user is thus able to reproduce the video 
and audio components of the multimedia source data 
using the display and speaker to confirm the content of 
the generated title. The user is then able to edit the title 
content according to the desired scenario using the key- 
board, mouse, and other command input devices while 
confirming the content of the title on the display and 
speakers. The result of this multimedia data manipula- 
tion is the scenario data St7. 

[0036] The scenario data St7 is basically a set of in- 
structions describing what source data is selected from 
all or a subset of the source data containing plural titles 
within a defined time period, and how the selected 
source data is reassembled to reproduce the scenario 
(sequence) intended by the user. Based on the instruc- 
tions received through the keyboard or other control de- 
vice, the CPU codes the position, length, and the relative 
time-based positions of the edited parts of the respec- 
tive multimedia source data streams St1, St3, and St5 
to generate the scenario data St7. 
[0037] The source stream buffer has a specific capac- 
ity, and is used to delay the multimedia source data 



streams St1 , St3, and St5 a known time Td and then 
output streams St1 , St3, and St5. 
[0038] This delay is required for synchronization with 
the editor encoding process. More specifically, when da- 
5 ta encoding and user generation of scenario data St7 
are executed simultaneously, i.e., when encoding im- 
mediately follows editing, time Td is required to deter- 
mine the content of the multimedia source data editing 
process based on the scenario data St7 as will be de- 
10 scribed further below. As a result, the multimedia source 
data must be delayed by time Td to synchronize the ed- 
iting process during the actual encoding operation. Be- 
cause this delay time Td is limited to the time required 
to synchronize the operation of the various system corn- 
's ponents in the case of sequential editing as described 
above, the source stream buffer is normally achieved by 
means of a high speed storage medium such as semi- 
conductor memory. 

[0039] During batch editing in which all multimedia 

20 source data is encoded at once ("batch encoded") after 
scenario data St7 is generated for the complete title, de- 
lay time Td must be long enough to process the com- 
plete title or longer. In this case, the source stream buffer 
may be a low speed, high capacity storage medium such 

25 as video tape, magnetic disk, or optical disk. 

[0040] The scructure (type) of media used for the 
source stream buffer may therefore be determined ac- 
cording to the delay time Td required and the allowable 
manufacturing cost. 

30 [0041] The encoding system controller 200 is con- 
nected to the scenario editor 1 00 and receives the sce- 
nario data St7 therefrom. Based on the time-base posi- 
tion and length information of the edit segment con- 
tained in the scenario data St7, the encoding system 

35 controller 200 generates the encoding parameter sig- 
nals St9, St11, and St13 for encoding the edit segment 
of the multimedia source data. The encoding signals 
St9, St1 1 , and St1 3 supply the parameters used for vid- 
eo, sub-picture, and audio encoding, including the en- 

40 coding start and end timing. Note that multimedia source 
data St1 , St3, and St5 are output after delay time Td by 
the source stream buffer, and are therefore synchro- 
nized to encoding parameter signals St9, St11, and 
St13. 

45 [0042] More specifically, encoding parameter signal 
St9 is the video encoding signal specifying the encoding 
timing of video stream St1 to extract the encoding seg- 
ment from the video stream St1 and generate the video 
encoding unit. Encoding parameter signal St11 is like- 

50 wise the sub-picture stream encoding signal used to 
generate the sub-picture encoding unit by specifying the 
encoding timing for sub-picture stream St3. Encoding 
parameter signal St1 3 is the audio encoding signal used 
to generate the audio encoding unit by specifying the 

55 encoding timing for audio stream St5. 

[0043] Based on the time-base relationship between 
the encoding segments of streams St1 , St3, and St5 in 
the multimedia source data contained in scenario data 
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St7, the encoding system controller 200 generates the 
timing signals St21, St23, and St25 arranging the en- 
coded multimedia-encoded stream in the specified time- 
base relationship. 

[0044] The encoding system controller 200 also gen- 
erates the reproduction time information IT defining the 
reproduction time of the title editing unit (video object, 
VOB), and the stream encoding data St33 defining the 
system encode parameters for multiplexing the encod- 
ed multimedia stream containing video, audio, and sub- 
picture data. Note that the reproduction time information 
IT and stream encoding data St33 are generated for the 
video object VOB of each title in one video zone VZ. 
[0045] The encoding system controller 200 also gen- 
erates the title sequence control signal St39, which de- 
clares the formatting parameters for formatting the title 
editing units VOB of each of the streams in a particular 
time-base relationship as a multimedia bitstream. More 
specifically, the title sequence control signal St39 is 
used to control the connections between the title editing 
units (VOB) of each title in the multimedia bitstream 
MBS, or to control the sequence of the interleaved title 
editing unit (VOBs) interleaving the title editing units 
VOB of plural reproduction paths. 
[0046] The video encoder 300 is connected to the 
source stream buffer of the scenario editor 1 00 and to 
the encoding system controller 200, and receives there- 
from the video stream St1 and video encoding parame- 
ter signal St9, respectively. Encoding parameters sup- 
plied by the video encoding signal St9 include the en- 
coding start and end timing, bit rate, the encoding con- 
ditions for the encoding start and end, and the material 
type. Possible material types include NTSC or PAL vid- 
eo signal, and telecine converted material. Based on the 
video encoding parameter signal St9, the video encoder 
300 encodes a specific part of the video stream St1 to 
generate the encoded video stream St15. 
[0047] The sub-picture encoder 500 is similarly con- 
nected to the source stream buffer of the scenario editor 
100 and to the encoding system controller 200, and re- 
ceives therefrom the sub-picture stream St3 and sub- 
picture encoding parameter signal St11, respectively. 
Based on the sub-picture encoding parameter signal 
St11, the sub-picture encoder 500 encodes a specific 
part of the sub-picture stream St3 to generate the en- 
coded sub-picture stream St17. 
[0048] The audio encoder 700 is also connected to 
the source stream buffer of the scenario editor 100 and 
to the encoding system controller 200, and receives 
therefrom the audio stream St5 and audio encoding pa- 
rameter signal St13, which supplies the encoding start 
and end timing. Based on the audio encoding parameter 
signal St13, the audio encoder 700 encodes a specific 
part of the audio stream St5 to generate the encoded 
audio stream St19. 

[0049] The video stream buffer 400 is connected to 
the video encoder 300 and to the encoding system con- 
troller 200. The video stream buffer 400 stores the en- 



coded video stream St15 input from the video encoder 
300, and outputs the stored encoded video stream St15 
as the time-delayed encoded video stream St27 based 
on the timing signal St21 supplied from the encoding 

5 system controller 200. 

[0050] The sub-picture stream buffer 600 is similarly 
connected to the sub-picture encoder 500 and to the en- 
coding system controller 200. The sub-picture stream 
buffer 600 stores the encoded sub-picture stream St17 

10 output from the sub-picture encoder 500, and then out- 
puts the stored encoded sub-picture stream St17 as 
time-delayed encoded sub-picture stream St29 based 
on the timing signal St23 supplied from the encoding 
system controller 200. 

15 [0051] The audio stream buffer 800 is similarly con- 
nected to the audio encoder 700 and to the encoding 
system controller 200. The audio stream buffer 800 
stores the encoded audio stream St1 9 input from the 
audio encoder 700, and then outputs the encoded audio 

20 stream St1 9 as the time-delayed encoded audio stream 
St31 based on the timing signal St25 supplied from the 
encoding system controller 200. 
[0052] The system encoder 900 is connected to the 
video stream buffer 400, sub-picture stream buffer 600, 

25 audio stream buffer 800, and the encoding system con- 
troller 200, and is respectively supplied thereby with the 
time-delayed encoded video stream St27, time-delayed 
encoded sub-picture stream St29, time-delayed encod- 
ed audio stream St31, and the stream encoding data 

30 st33. Note that the system encoder 900 is a multiplexer 
that multiplexes the time-delayed streams St27, St29, 
and St31 based on the stream encoding data St33 (tim- 
ing signal) to generate title editing unit (VOB) St35. The 
stream encoding data St33 contains the system encod- 
es ing parameters, including the encoding start and end 
timing. 

[0053] The video zone formatter 1 300 is connected to 
the system encoder 900 and the encoding system con- 
troller 200 from which the title editing unit (VOB) St35 

40 and title sequence control signal St39 (timing signal) are 
respectively supplied. The title sequence control signal 
St39 contains the formatting start and end timing, and 
the formatting parameters used to generate (format) a 
multimedia bitstream MBS. The video zone formatter 

45 1 300 rearranges the title editing units (VOB) St35 in one 
video zone VZ in the scenario sequence defined by the 
user based on the title sequence control signal St39 to 
generate the edited multimedia stream data St43. 
[0054] The multimedia bitstream MBS St43 edited ac- 

50 cording to the user-defined scenario is then sent to the 
recorder 1 200. The recorder 1 200 processes the edited 
multimedia stream data St43 to the data stream St45 
format of the recording medium M, and thus records the 
formatted data stream St45 to the recording medium M. 

55 Note that the multimedia bitstream MBS recorded to the 
recording medium M contains the volume file structure 
VFS, which includes the physical address of the data on 
the recording medium generated by the video zone for- 
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matter 1 300. 

[0055] Note that the encoded multimedia bitstream 
MBS St35 may be output directly to the decoder to im- 
mediately reproduce the edited title content. It will be 
obvious that the output multimedia bitstream MBS will 
not in this case contain the volume file structure VFS. 

Authoring decoder DC 

[0056] A preferred embodiment of the authoring de- 
coder DC used to decode the multimedia bitstream MBS 
edited by the authoring encoder EC of the present in- 
vention, and thereby reproduce the content of each title 
unit according to the user-defined scenario, is described 
next below with reference to Fig. 3. Note that in the pre- 
ferred embodiment described below the multimedia bit- 
stream St45 encoded by the authoring encoder EC is 
recorded to the recording medium M. 
[0057] As shown in Fig. 3, the authoring decoder DC 
comprises a multimedia bitstream producer 2000, sce- 
nario selector 2100, decoding system controller 2300, 
stream buffer 2400, system decoder 2500, video buffer 
2600, sub-picture buffer 2700, audio buffer 2800, syn- 
chronizer 2900, video decoder 3800, sub-picture decod- 
er 3100, audio decoder 3200, synthesizer 3500, video 
data output terminal 3600, and audio data output termi- 
nal 3700. 

[0058] The bitstream producer 2000 comprises a re- 
cording media drive unit 2004 for driving the recording 
medium M; a reading head 2006 for reading the infor- 
mation recorded to the recording medium M and pro- 
ducing the binary read signal St57; a signal processor 
2008 for variously processing the read signal St57 to 
generate the reproduced bitstream St61; and a repro- 
duction controller 2002. 

[0059] The reproduction controller 2002 is connected 
to the decoding system controller 2300 from which the 
multimedia bitstream reproduction control signal St53 is 
supplied, and in turn generates the reproduction control 
signals St55 and St59 respectively controlling the re- 
cording media drive unit (motor) 2004 and signal proc- 
essor 2008. 

[0060] So that the user-defined video, sub-picture, 
and audio portions of the multimedia title edited by the 
authoring encoder EC are reproduced, the authoring de- 
coder DC comprises a scenario selector 21 0C for se- 
lecting and reproducing the corresponding scenes 
(titles) . The scenario selector 21 00 then outputs the se- 
lected titles as scenario data to the authoring decoder 
DC. 

[0061] The scenario selector 2100 preferably com- 
prises a keyboard, CPU, and monitor. Using the key- 
board, the user then inputs the desired scenario based 
on the content of the scenario input by the authoring en- 
coder EC. Based on the keyboard input, the CPU gen- 
erates the scenario selection data St51 specifying the 
selected scenario. The scenario selector 2100 is con- 
nected by an infrared communications device, for ex- 



ample, to the decoding system controller 2300, to which 
it inputs the scenario selection data St51 . 
[0062] Based on the scenario selection data St51 , the 
decoding system controller 2300 then generates the bit- 

5 stream reproduction control signal St53 controlling the 
operation of the bitstream producer 2000. 
[0063] The stream buffer 2400 has a specific buffer 
capacity used to temporarily store the reproduced bit- 
stream St61 input from the bitstream producer 2000, ex- 

10 tract the address information and initial synchronization 
data SCR (system clock reference) for each stream, and 
generate bitstream control data St63. The stream buffer 
2400 is also connected to the decoding system control- 
ler 2300, to which it supplies the generated bitstream 

15 control data St63. 

[0064] The synchronizer 2900 is connected to the de- 
coding system controller 2300 from which it receives the 
system clock reference SCR contained in the synchro- 
nization control data St81 to set the internal system 

20 clock STC and supply the reset system clock St79 to the 
decoding system controller 2300. 
[0065] Based on this system clock St79, the decoding 
system controller 2300 also generates the stream read 
signal St65 at a specific interval and outputs the read 

25 signal St65 to the stream buffer 2400. 

[0066] Based on the supplied read signal St65, the 
stream buffer 2400 outputs the reproduced bitstream 
St61 at a specific interval to the system decoder 2500 
as bitstream St67. 

30 [0067] Based on the scenario selection data St51 , the 
decoding system controller 2300 generates the decod- 
ing signal St69 defining the stream Ids for the video, sub- 
picture, and audio bitstreams corresponding to the se- 
lected scenario, and outputs to the system decoder 

35 2500. 

[0068] Based on the instructions contained in the de- 
coding signal St69, the system decoder 2500 respec- 
tively outputs the video, sub-picture, and audio bit- 
streams input from the stream buffer 2400 to the video 

40 buffer 2600, sub-picture buffer 2700, and audio buffer 
2800 as the encoded video stream St71 , encoded sub- 
picture stream St73, and encoded audio stream St75. 
[0069] The system decoder 2500 detects the presen- 
tation time stamp PTS and decoding time stamp DTS of 

45 the smallest control unit in each bitstream St67 to gen- 
erate the time information signal St77. This time infor- 
mation signal St77 is supplied to the synchronizer 2900 
through the decoding system controller 2300 as the syn- 
chronization control data St81 , 

so [0070] Based on this synchronization control data 
St81, the synchronizer 2900 determines the decoding 
start timing whereby each of the bitstreams will be ar- 
ranged in the correct sequence after decoding, and then 
generates and inputs the video stream decoding start 

55 signal St89 to the video decoder 3800 based on this de- 
coding timing. The synchronizer 2900 also generates 
and supplies the sub-picture decoding start signal St91 
and audio stream decoding start signal St93 to the sub- 



15 



EP 0 920 203 B1 



16 



picture decoder 3100 and audio decoder 3200, respec- 
tively. 

[0071] The video decoder 3800 generates the video 
output request signal St84 based on the video stream 
decoding start signal St89, and outputs to the video buff- 
er 2600. In response to the video output request signal 
St84, the video buffer 2600 outputs the video stream 
St83 to the video decoder 3800. The video decoder 
3800 thus detects the presentation time information 
contained in the video stream St83, and disables the vid- 
eo output request signal St84 when the length of the re- 
ceived video stream St83 is equivalent to the specified 
presentation time. A video stream equal in length to the 
specified presentation time is thus decoded by the video 
decoder 3800, which outputs the reproduced video sig- 
nal St1 04 to the synthesizer 3500. 
[0072] The sub-picture decoder 31 00 similarly gener- 
ates the sub-picture output request signal St86 based 
on the sub-picture decoding start signal St91, and out- 
puts to the sub-picture buffer 2700. In response to the 
sub-picture output request signal St86, the sub-picture 
buffer 2700 outputs the sub-picture stream St85 to the 
sub-picture decoder 3100. Based on the presentation 
time information contained in the sub-picture stream 
St85, the sub-picture decoder 31 00 decodes a length of 
the sub-picture stream St85 corresponding to the spec- 
ified presentation time to reproduce and supply to the 
synthesizer 3500 the sub-picture signal St99. 
[0073] The synthesizer 3500 superimposes the video 
signal St104 and sub-picture signal St99 to generate 
and output the multi-picture video signal St105 to the 
video data output terminal 3600. 
[0074] The audio decoder 3200 generates and sup- 
plies to the audio buffer 2800 the audio output request 
signal St88 based on the audio stream decoding start 
signal St93. The audio buffer 2800 thus outputs the au- 
dio stream St87 to the audio decoder 3200. The audio 
decoder 3200 decodes a length of the audio stream 
St87 corresponding to the specified presentation time 
based on the presentation time information contained in 
the audio stream St87, and outputs the decoded audio 
stream St101 to the audio data output terminal 3700. 
[0075] It is thus possible to reproduce a user-defined 
multimedia bitstream MBS in real-time according to a 
user-defined scenario. More specifically, each time the 
user selects a different scenario, the authoring decoder 
DC is able to reproduce the title content desired by the 
user in the desired sequence by reproducing the multi- 
media bitstream MBS corresponding to the selected 
scenario. 

[0076] It is therefore possible by means of the author- 
ing system of the present invention to generate a multi- 
media bitstream according to plural user-defined sce- 
narios by real-time orbatch encoding multimedia source 
data in a manner whereby the substreams of the small- 
est editing units (scenes), which can be divided into plu- 
ral substreams, expressing the basic title content are ar- 
ranged in a specific time-base relationship. 



[0077] The multimedia bitstream thus encoded can 
then be reproduced according to the one scenario se- 
lected from among plural possible scenarios. It is also 
possible to change scenarios while playback is in 

5 progress, i.e., to select a different scenario and dynam- 
ically generate a new multimedia bitstream according to 
the most recently selected scenario. It is also possible 
to dynamically select and reproduce any of plural 
scenes while reproducing the title content according to 

10 a desired scenario. 

[0078] It is therefore possible by means of the author- 
ing system of the present invention to encode and not 
only reproduce but to repeatedly reproduce a multime- 
dia bitstream MBS in real-time. 

15 [0079] A detail of the authoring system is disclosed 
Japanese Patent Application filed September 27, 1996, 
and entitled and assigned to the same assignee as the 
present application. 



[0080] An example of a digital video disk (DVD) with 
only one recording surface (a single-sided DVD) is 
shown in Fig. 4. 

25 [0081] The DVD recording medium RC1 in the pre- 
ferred embodiment of the invention comprises a data re- 
cording surface RS1 to and from which data is written 
and read by emitting laser beam LS, and a protective 
layer PL1 covering the data recording surface RS1 . A 

30 backing layer BL1 is also provided on the back of data 
recording surface RS1. The side of the disk on which 
protective layer PL1 is provided is therefore referred to 
below as side SA (commonly "side A"), and the opposite 
side (on which the backing layer BL1 is provided) is re- 

35 ferred to as side SB ("side B") . Note that digital video 
disk recording media having a single data recording sur- 
face RS1 on only one side such as chis DVD recording 
medium RC1 is commonly called a single-sided single 
layer disk. 

40 [0082] A detailed illustration of area C1 in Fig. 4 is 
shown in Fig. 5. Note that the data recording surface 
RS1 is formed by applying a metallic thin film or other 
reflective coating as a data layer 41 09 on a first trans- 
parent layer 41 08 having a particular thickness T1 . This 

45 first transparent layer 41 08 also functions as the protec- 
tive layer PL1 . A second transparent substrate 4111 of 
a thickness T2 functions as the backing layer BL1 , and 
is bonded to the first transparent layer 41 08 by means 
of an adhesive layer 4110 disposed therebetween. 

50 [0083] A printing layer 4112 for printing a disk label 
may also be disposed on the second transparent sub- 
strate 4111 as necessary. The printing layer 41 12 does 
not usually cover the entire surface area of the second 
transparent substrate 4111 (backing layer BL1 ), but only 

55 the area needed to print the text and graphics of the disk 
label. The area of second transparent substrate 41 1 1 to 
which the printing layer 4112 is not formed may be left 
exposed. Light reflected from the data layer 4109 (me- 



20 DVD 
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tallic thin film) forming the data recording surface RS1 
can therefore be directly observed where the label is not 
printed when the digital video disk is viewed from side 
SB. As a result, the background looks like a silver-white 
over which the printed text and graphics float when the 
metallic thin film is an aluminum thin film, for example. 
[0084] Note that it is only necessary to provide the 
printing layer 4112 where needed for printing, and it is 
not necessary to provide the printing layer 41 12 over the 
entire surface of the backing layer BL1 . 
[0085] A detailed illustration of area C2 in Fig. 5 is 
shown in Fig. 6. Pits and lands are molded to the com- 
mon contact surface between the first transparent layer 
4108 and the data layer 4109 on side SA from which 
data is read by emitting a laser beam LS, and data is 
recorded by varying the lengths of the pits and lands (i. 
e., the length of the intervals between the pits) . More 
specifically, the pit and land configuration formed on the 
first transparent layer 41 08 is transferred to the data lay- 
er 41 09. The lengths of the pits and lands is shorter, and 
the pitch of the data tracks formed by the pit sequences 
is narrower, than with a conventional Compact Disc 
(CD). The surface recording density is therefore greatly 
improved. 

[0086] Side SA of the first transparent layer 41 08 on 
which data pits are not formed is a flat surface. The sec- 
ond transparent substrate 41 1 1 is for reinforcement, and 
is a transparent panel made from the same material as 
the first transparent layer 4108 with both sides flat. 
Thicknesses T1 and T2 are preferably equal and com- 
monly approximately 0.6 mm, but the invention shall not 
be so limited. 

[0087] As with a CD, information is read by irradiating 
the surface with a laser beam LS and detecting the 
change in the reflectivity of the light spot. Because the 
objective lens aperture NA can be large and the wave- 
length 1 of the light beam small in a digital video disk 
system, the diameter of the light spot Ls used can be 
reduced to approximately 1/1 .6 the light spot needed to 
read a CD. Note that this means the resolution of the 
laser beam LS in the DVD system is approximately 1.6 
times the resolution of a conventional CD system. 
[0088] The optical system used to read data from the 
digital video disk uses a short 650 nm wavelength red 
semiconductor laser and an objective lens with a 0.6 mm 
aperture NA. By thus also reducing the thickness T of 
the transparent panels to 0.6 mm, more than 5 GB of 
data can be stored to one side of a 120 mm diameter 
optical disk. 

[0089] It is therefore possible to store motion picture 
(video) images having an extremely large per unit data 
size to a digital video disk system disk without losing 
image quality because the storage capacity of a single- 
sided, single-layer recording medium RC1 with one data 
recording surface RS1 as thus described is nearly ten 
times the storage capacity of a conventional CD. As a 
result, while the video presentation time of a conven- 
tional CD system is approximately 74 minutes if image 



quality is sacrificed, high quality video images with a vid- 
eo presentation time exceeding two hours can be re- 
corded to a DVD. 

[0090] The digital video disk is therefore well-suited 

5 as a recording medium for video images. 

[0091] A digital video disk recording medium with plu- 
ral recording surfaces RS as described above is shown 
in Figs. 7 and 8. The DVD recording medium RC2 shown 
in Fig. 7 comprises two recording surfaces, i.e., first re- 

10 cording surface RS1 and semi-transparent second re- 
cording surface RS2, on the same side, i.e. side SA, of 
the disk. Data can be simultaneously recorded or repro- 
duced from these two recording surfaces by using dif- 
ferent laser beams LS1 and LS2 for the first recording 

15 surface RS1 and the second recording surface RS2. It 
is also possible to read/write both recording surfaces 
RS1 and RS2 using only one of the laser beams LS1 or 
LS2. Note that recording media thus comprised are 
called "single-side, dual-layer disks." 

20 [0092] It should also be noted that while two recording 
surfaces RS1 and RS2 are provided in this example, it 
is also possible to produce digital video disk recording 
media having more than two recording surfaces RS. 
Disks thus comprised are known as "single-sided, multi- 

25 layer disks." 

[0093] Though comprising two recording surfaces 
similarly to the recording media shown in Fig. 7, the DVD 
recording medium RC3 shown in Fig. 8 has the record- 
ing surfaces on opposite sides of the disk, i. e., has the 

30 first data recording surface RS1 on side SA and the sec- 
ond data recording surface RS2 on side SB. It will also 
be obvious that while only two recording surfaces are 
shown on one digital video disk in this example, more 
than two recording surfaces may also be formed on a 

35 double-sided digital video disk. As with the recording 
medium shown in Fig. 7, it is also possible to provide 
two separate laser beams LS1 and LS2 for recording 
surfaces RS1 and RS2, or to read/write both recording 
surfaces RS1 and RS2 using a single laser beam. Note 

40 that this type of digital video disk is called a "double- 
sided, dual-layer disk." It will also be obvious that a dou- 
ble-sided digital video disk can be comprised with two 
or more recording surfaces per side. This type of disk is 
called a "double-sided, multi-layer disk." 

45 [0094] A plan view from the laser beam LS irradiation 
side of the recording surface RS of the DVD recording 
medium RC is shown in Fig. 9 and Fig. 10. Note that a 
continuous spiral data recording track TR is provided 
from the inside circumference to the outside circumfer- 

50 ence of the DVD. The data recording track TR is divided 
into plural sectors each having the same known storage 
capacity. Note that for simplicity only the data recording 
track TR is shown in Fig. 9 with more than three sectors 
per revolution. 

55 [0095] As shown in Fig. 9, the data recording track TR 
is normally formed clockwise inside to outside (see ar- 
row DrA) from the inside end point IA at the inside cir- 
cumference of disk RCA to the outside end point OA at 
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the outside circumference of the disk with the disk RCA 
rotating counterclockwise RdA. This type of disk RCA is 
called a clockwise disk, and the recording track formed 
thereon is called a clockwise track TRA. 
[0096] Depending upon the application, the recording 
track TRB may be formed clockwise from outside to in- 
side circumference (see arrow DrB in Fig. 10) from the 
outside end point OB at the outside circumference cf 
disk RCB to the inside end point IB at the inside circum- 
ference of the disk with the disk RCB rotating clockwise 
RdB. Because the recording track appears to wind 
counterclockwise when viewed from the inside circum- 
ference to the outside circumference on disks with the 
recording track formed in the direction of arrow DrB, 
these disks are referred to as counterclockwise disk 
RCB with counterclockwise track TRB to distinguish 
them from disk RCA in Fig. 9. Note that track directions 
DrA and DrB are the track paths along which the laser 
beam travels when scanning the tracks for recording 
and playback. Direction of disk rotation RdA in which 
disk RCA turns is thus opposite the direction of track 
path DrA, and direction of disk rotation RdB in which disk 
RCB turns is thus opposite the direction of track path 
DrB. 

[0097] An exploded view of the single-sided, dual-lay- 
er disk RC2 shown in Fig. 7 is shown as disk RC2o in 
Fig. 1 1 . Note that the recording tracks formed on the two 
recording surfaces run in opposite directions. Specifi- 
cally, a clockwise recording track TRA as shown in Fig. 
9 is formed in clockwise direction DrA on the (lower) first 
data recording surface RS1 , and a counterclockwise re- 
cording track TRB formed in counterclockwise direction 
DrB as shown in Fig. 1 0 is provided on the (upper) sec- 
ond data recording surface RS2. As a result, the outside 
end points OA and OB of the first and second (top and 
bottom) tracks are at the same radial position relative to 
the center axis of the disk RC2o. Note that track paths 
DrA and DrB of tracks TR are also the data read/write 
directions to disk RC. The first and second (top and bot- 
tom) recording tracks thus wind opposite each other with 
this disk RC, i.e., the track paths DrA and DrB of the top 
and bottom recording layers are opposite track paths. 
[0098] Opposite track path type, single-sided, dual- 
layer disks RC2o rotate in direction RdA corresponding 
to the first recording surface RS1 with the laser beam 
LS travelling along track path DrA to trace the recording 
track on the first recording surface RS1 . When the laser 
beam LS reaches the outside end point OA, the laser 
beam LS can be refocused to end point OB on the sec- 
ond recording surface RS2 to continue tracing the re- 
cording track from the first to the second recording sur- 
face uninterrupted. The physical distance between the 
recording tracks TRA and TRB on the first and second 
recording surfaces RS1 and RS2 can thus be instanta- 
neously eliminated by simply adjusting the focus of the 
laser beam LS. 

[0099] It is therefore possible with an opposite track 
path type, single-sided, dual-layer disk RC2o to easily 



process the recording tracks disposed to physically dis- 
crete top and bottom recording surfaces as a single con- 
tinuous recording track. It is therefore also possible in 
an authoring system as described above with reference 

5 to Fig. 1 to continuously record the multimedia bitstream 
MBS that is the largest multimedia data management 
unit to two discrete recording surfaces RS1 and RS2 on 
a single recording medium RC2o. 
[0100] It should be noted that the tracks on recording 

io surfaces RS1 and RS2 can be wound in the directions 
opposite those described above, i.e., the counterclock- 
wise track TRB may be provided on the first recording 
surface RS1 and the clockwise track TRA on the second 
recording surface RS2. In this case the direction of disk 

15 rotation is also changed to a clockwise rotation RdB, 
thereby enabling the two recording surfaces to be used 
as comprising a single continuous recording track as de- 
scribed above. For simplification, a further example of 
this type of disk is therefore neither shown nor described 

20 below. 

[0101] It is therefore possible by thus constructing the 
digital video disk to record the multimedia bitstream 
MBS for a feature-length title to a single opposite track 
path type, single-sided, dual-layer disk RC2o. Note that 

25 this type of digital video disk medium is called a single- 
sided dual-layer disk with opposite track paths. 
[0102] Another example of the single-sided, dual-lay- 
er DVD recording medium RC2 shown in Fig. 7 is shown 
as disk RC2p in Fig. 12. The recording tracks formed on 

30 both first and second recording surfaces RS1 and RS2 
are clockwise tracks TRA as shown in Fig. 9. In this 
case, the single-sided, dual-layer disk RC2p rotates 
counterclockwise in the direction of arrow RdA, and the 
direction of laser beam LS travel is the same as the di- 

35 rection of the track spiral, i.e., the track paths of the top 
and bottom recording surfaces are mutually parallel 
(parallel track paths). The outside end points OA of both 
top and bottom tracks are again preferably positioned 
at the same radial position relative to the center axis of 

40 the disk RC2p as described above. As also described 
above with disk RC2o shown in Fig. 1 1 , the access point 
can be instantaneously shifted from outside end point 
OA of track TRA on the first recording surface RS1 to 
the outside end point OA of track TRA on the second 

45 recording surface RS2 by appropriately adjusting the fo- 
cus of the laser beam LS at outside end point OA. 
[01 03] However, for the laser beam LS to continuous- 
ly access the clockwise recording track TRA on the sec- 
ond recording surface RS2, the recording medium 

50 RC2p must be driven in the opposite direction (clock- 
wise, opposite direction RdA). Depending on the radial 
position of the laser beam LS, however, it is inefficient 
to change the rotational direction of the recording medi- 
um. As shown by the diagonal arrow in Fig. 1 2, the laser 

55 beam LS is therefore moved from the outside end point 
OA of the track on the first recording surface RS1 to the 
inside end point IA of the track on the second recording 
surface RS2 to use these physically discrete recording 



. 21 



EP 0 920 203 B1 



22 



tracks as one logically continuous recording track. 
[0104] Rather than using the recording tracks on top 
and bottom recording surfaces as one continuous re- 
cording track, it is also possible to use the recording 
tracks to record the multimedia bitstreams MBS for dif- 
ferent titles. This type of digital video disk recording me- 
dium is called a "single-sided, dual-layer disk with par- 
allel track paths." 

[0105] Note that if the direction of the tracks formed 
on the recording surfaces RS1 and RS2 is opposite that 
described above, i.e., counterclockwise recording 
tracks TRB are formed, disk operation remains the 
same as that described above except for the direction 
of disk rotation, which is clockwise as shown by arrow 
RdB. 

[0106] Whether using clockwise or counterclockwise 
recording tracks, the single-sided, dual-layer disk RC2p 
with parallel track paths thus described is well-suited to 
storing on a single disk encyclopedia and similar multi- 
media bitstreams comprising multiple titles that are fre- 
quently and randomly accessed. 
[0107] An exploded view of the dual-sided single-lay- 
er DVD recording medium RC3 comprising one record- 
ing surface layer RS1 and RS2 on each side as shown 
in Fig. 8 is shown as DVD recording medium RC3s in 
Fig. 13. Clockwise recording track TRA is provided on 
the one recording surface RS1 , and a counterclockwise 
recording track TRB is provided on the other recording 
surface RS2. As in the preceding recording media, the 
outside end points OA and OB of the recording tracks 
on each recording surface are preferably positioned at 
the same radial position relative to the center axis of the 
DVD recording medium RC3s. 

[0108] Note that while the recording tracks on these 
recording surfaces RS1 and RS2 rotate in opposite di- 
rections, the track paths are symmetrical. This type of 
recording medium is therefore known as a doubie-sided 
dual layer disk with symmetrical track paths. This dou- 
ble-sided dual layer disk with symmetrical track paths 
RC3s rotates in direction RdA when reading/writing the 
first recording surface RS1 . As a result, the track path 
on the second recording surface RS2 on the opposite 
side is opposite the direction DrB in which the track 
winds, i.e., direction DrA. Accessing both recording sur- 
faces RS1 and RS2 using a single laser beam LS is 
therefore not realistic irrespective of whether access is 
continuous or non-continuous, in addition, a multimedia 
bitstream MBS is separately recorded to the recording 
surfaces on the first and second sides of the disk. 
[01 09] A different example of the double-sided single 
layer disk RC3 shown in Fig. 8 is shown in Fig. 14 as 
disk RC3a. Note that this disk comprises clockwise re- 
cording tracks TRA as shown in Fig. 9 on both recording 
surfaces RS1 and RS2. As with the preceding recording 
media, the outside end points OA and OA of the record- 
ing tracks on each recording surface are preferably po- 
sitioned at the same radial position relative to the center 
axis of the DVD recording medium RC3a. Unlike the 



double-sided dual layer disk with symmetrical track 
paths RC3s described above, the tracks on these re- 
cording surfaces RS1 and RS2 are asymmetrical. This 
type of disk is therefore known as a double-sided dual 

5 layer disk with asymmetrical track paths. This double- 
sided dual layer disk with asymmetrical track paths 
RC3a rotates in direction RdA when reading/writing the 
first recording surface RS1. As a result, the track path 
on the second recording surface RS2 on the opposite 

10 side is opposite the direction DrA in which the track 
winds, i.e., direction DrB. 

[0110] This means that if a laser beam LS is driven 
continuously from the inside circumference to the out- 
side circumference on the first recording surface RS1 , 

15 and then from the outside circumference to the inside 
circumference on the second recording surface RS2, 
both sides of the recording medium RC3a can be read/ 
written without turning the disk over and without provid- 
ing different laser beams for the two sides. 

20 [0111] The track paths for recording surfaces RS1 
and RS2 are also the same with this double-sided dual 
layer disk with asymmetrical track paths RC3a. As a re- 
sult, it is also possible to read/write both sides of the disk 
without providing separate laser beams for each side if 

25 the recording medium RC3a is turned over between 
sides, and the read/write apparatus can therefore be 
constructed economically. 

[0112] It should be noted that this recording medium 
remains functionally identical even if counterclockwise 
30 recording track TRB is provided in place of clockwise 
recording track TRA on both recording surfaces RS1 
and RS2. 

[0113] As described above, the true value of a DVD 
system whereby the storage capacity of the recording 

35 medium can be easily increased by using a multiple lay- 
er recording surface is realized in multimedia applica- 
tions whereby plural video data units, plural audio data 
units, and plural graphics data units recorded to a single 
disk are reproduced through interactive operation by the 

^o user. 

[0114] It is therefore possible to achieve one long- 
standing desire of software (programming) providers, 
specifically, to provide programming content such as a 
commercial movie on a single recording medium in plu- 
45 ral versions for different language and demographic 
groups while retaining the image quality of the original. 

Parental control 

50 [0115] Content providers of movie and video titles 
have conventionally had to produce, supply, and man- 
age the inventory of individual titles in multiple languag- 
es, typically the language of each distribution market, 
and multi-rated title packages conforming to the paren- 

55 tal control (censorship) regulations of individual coun- 
tries in Europe and North America. The time and re- 
sources required for this are significant. While high im- 
age quality is obviously important, the programming 
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content must also be consistently reproducible. 
[0116] The digital video disk recording medium is 
close to solving these problems. 

Multiple angles 

[01 17] One interactive operation widely sought in mul- 
timedia applications today is for the user to be able to 
change the position from which a scene is viewed during 
reproduction of that scene. This capability is achieved 
by means of the multiple angle function. 
[0113] This multiple angle function makes possible 
applications whereby, for example, a user can watch a 
baseball game from different angles (or virtual positions 
in the stadium), and can freely switch between the views 
while viewing is in progress. In this example of a base- 
ball game, the available angles may include a position 
behind the backstop centered on the catcher, batter, and 
pitcher; one from behind the backstop centered on a 
fielder, the pitcher, and the catcher; and one from center 
field showing the view to the pitcher and catcher. 
[0119] To meet these requirements, the digital video 
disk system uses MPEG, the same basic standard for- 
mat used with Video-Cds to record the video, audio, 
graphics, and other signal data. Because of the differ- 
ences in storage capacity, transfer rates, and signal 
processing performance within the reproduction appa- 
ratus, DVD uses MPEG2, the compression method and 
data format of which differ slightly from the MPEG1 for- 
mat used with Video-Cds. 

[0120] It should be noted that the content of and dif- 
ferences between the MPEG1 and MPEG2 standards 
have no direct relationship to the intent of the present 
invention, and further description is therefore omitted 
below (for more information, see MPEG specifications 
ISO-11172 and ISO-13818). 

[0121] The data structure of the DVD system accord- 
ing to the present invention is described in detail below 
with reference to Figs. 1 6, 1 7, 1 8, 1 9, 20, and 21 . 

Multi-scene control 

[0122] A fully functional and practical parental lock 
playback function and multi-angle scene playback func- 
tion must enable the user to modify the system output 
in minor, subtle ways while still presenting substantially 
the same video and audio output. If these functions are 
achieved by preparing and recording separate titles sat- 
isfying each of the many possible parental lock and mul- 
ti-angle scene playback requests, titles that are sub- 
stantially identical and differ in only minor ways must be 
recorded to the recording medium. This results in iden- 
tical data being repeatedly recorded to the larger part of 
the recording medium, and significantly reduces the uti- 
lization efficiency of the available storage capacity. More 
particularly, it is virtually impossible to record discrete 
titles satisfying every possible request even using the 
massive capacity of the digital video disk medium. While 



it may be concluded that this problem can be easily 
solved by increasing the capacity of the recording me- 
dium, this is an obviously undesirable solution when the 
effective use of available system resources is consid- 
ered. 

[0123] Using multi-scene control, the concept of 
which is described in another section below, in a DVD 
system, it is possible to dynamically construct titles for 
numerous variations of the same basic content using the 
smallest possible amount of data, and thereby effective- 
ly utilize the available system resources (recording me- 
dium). More specifically, titles that can be played back 
with numerous variations are constructed from basic 
(common) scene periods containing data common to 
each title, and multi-scene periods comprising groups 
of different scenes corresponding to the various re- 
quests. During reproduction, the user is able to freely 
and at any time select particular scenes from the multi- 
scene periods to, dynamically construct a title conform- 
ing to the desired content, e.g., a title omitting certain 
scenes using the parental lock control function. 
[01 24] Note that multi-scene control enabling a paren- 
tal lock playback control function and multi-angle scene 
playback is described in another section below with ref- 
erence to Fig. 21 . 

Data structure of the DVD system 

[0125] The data structure used in the authoring sys- 
tem of a digital video disk system according to the 
present invention is shown in Fig. 22. To record a mul- 
timedia bitstream MBS, this digital video disk system di- 
vides the recording medium into three major recording 
areas, the lead-in area LI, the volume space VS, and 
the lead-out area LO. 

[0126] The lead-in area LI is provided at the inside cir- 
cumference area of the optical disk. In the disks de- 
scribed with reference to Figs. 9 and 1 0, the lead-in area 
LI is positioned atthe inside end points IA and IB of each 
track. Data for stabilizing the operation of the reproduc- 
ing apparatus when reading starts is written to the lead- 
in area LI. 

[0127] The lead-out area LO is correspondingly locat- 
ed at the outside circumference of the optical disk, i.e., 
at outside end points OA and OB of each track in the 
disks described with reference to Figs. 9 and 10. Data 
identifying the end of the volume space VS is recorded 
in this lead-out area LO. 

[0128] The volume space VS is located between the 
lead-in area LI and lead-out area LO, and is recorded 
as a one-dimensional array of n+1 (where n is an integer 
greater than or equal to zero) 2048-byte logic sectors 
LS. The logic sectors LS are sequentially number #0, 
#1 , #2, ... #n. The volume space VS is also divided into 
a volume and file structure management area VFS and 
a file data structure area FDS. 

[01 29] The volume and file structure management ar- 
ea VFS comprises m+1 logic sectors LS#0 to LS#m 
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(where m is an integer greater than or equal to zero and 
less than n. The file data structure FDS comprises n-m 
logic sectors LS #m+1 to LS #n. 
[0130] Note that this file data structure area FDS cor- 
responds to the multimedia bitstream MBS shown in Fig. 
1 and described above. 

[0131] The volume file structure VFS is the file system 
for managing the data stored to the volume space VS 
as files, and is divided into logic sectors LS#0 - LS#m 
where m is the number of sectors required to store all 
data needed to manage the entire disk, and is a natural 
number less than n. Information for the files stored to 
the file data structure area FDS is written to the volume 
file structure VFS according to a known specification 
such as ISO-9660 or ISO-13346. 
[0132] The file data structure area FDS comprises n- 
m logic sectors LS#m - LS#n, each comprising a video 
manager VMG sized to an integer multiple of the logic 
sector (2048 x I, where I is a known integer), and k video 
title sets VTS #1 - VTS#k (where k is a natural number 
less than 100). 

[01 33] The video manager VMG stores the title man- 
agement information for the entire disk, and information 
for building a volume menu used to set and change re- 
production control of the entire volume. 
[01 34] Any video title set VTS #k is also called a "vid- 
eo file" representing a title comprising video, audio, and/ 
or still image data. 

[0135] The internal structure of each video title set 
VTS shown in Fig. 22 is shown in Fig. 16. Each video 
title set VTS comprises VTS information VTSI describ- 
ing the management information for the entire disk, and 
the VTS title video objects VOB (VTSTT_VOBS), i.e., 
the system stream of the multimedia bitstream. The VTS 
information VTSI is described first below, followed by the 
VTS title VOBS. 

[01 36] The VTS information primarily includes the VT- 
SI management table VTSI_MAT and VTSPGC infor- 
mation table VTS_PGClT. 

[0137] The VTSI management table VTSI_MAT 
stores such information as the internal structure of the 
video title set VTS, the number of selectable audio 
streams contained in the video title set VTS, the number 
of sub-pictures, and the video title set VTS location (stor- 
age address). 

[0138] The VTSPGC information table VTS_PGCIT 
records i (where i is a natural number) program chain 
(PGC) data blocks VTS_PGCI #1 - VTS_PGCI #i for 
controlling the playback sequence. Each of the table en- 
tries VTS_PGCI #i is a data entry expressing the pro- 
gram chain, and comprises j (where j is a natural 
number) cell playback information blocks C_PBI #1 - 
C_PBI #j. Each cell playback information block C_PBI 
#j contains the playback sequence of the cell and play- 
back control information. 

[01 39] The program chain PGC is a conceptual struc- 
ture describing the story of the title content, and there- 
fore defines the structure of each title by describing the 



cell playback sequence. Note that these cells are de- 
scribed in detail below. 

[0140] If, for example, the video title set information 
relates to the menus, the video title set information VTSI 

5 is stored to a buffer in the playback device when play- 
back starts. If the user then presses a MENU button on 
a remote control device, for example, during playback, 
the playback device references the buffer to fetch the 
menu information and display the top menu #1 . If the 

10 menus are hierarchical, the main menu stored as pro- 
gram chain information VTS_PGCl #1 may be dis- 
played, for example, by pressing the MENU button, 
VTS_PGCI #2 - #9 may correspond to submenus ac- 
cessed using the numeric keypad on the remote control, 

15 and VTS_PGCI #1 0 and higher may correspond to ad- 
ditional submenus further down the hierarchy. Alterna- 
tively, VTS_PGCI #1 may be the top menu displayed by 
pressing the MENU button, while VTS_PGCI #2 and 
higher may be voice guidance reproduced by pressing 

20 the corresponding numeric key. 

[0141] The menus themselves are expressed by the 
plural program chains defined in this table. As a result, 
the menus may be freely constructed in various ways, 
and shall not be limited to hierarchical or non-hierarchi- 

25 cal menus or menus containing voice guidance. 

[0142] In the case of a movie, for example, the video 
title set information VTSI is stored to a buffer in the play- 
back device when playback starts, the playback device 
references the cell playback sequence described by the 

30 program chain PGC, and reproduces the system 
stream. 

[0143] The "cells" referenced here may be all or part 
of the system stream, and are used as access points 
during playback. Cells can therefore be used, for exam- 

35 pie, as the "chapters" into which a title may be divided. 
[0144] Note that each of the PGC information entries 
C_PBI #j contain both cell playback processing informa- 
tion and a cell information table. The cell playback 
processing information comprises the processing infor- 

40 mation needed to reproduce the cell, such as the pres- 
entation time and number of repetitions. More specifi- 
cally, this information includes the cell block mode CBM, 
cell block type CBT, seamless playback flag SPF, inter- 
leaved allocation flag IAF, STC resetting flag STCDF, 

45 cell presentation time C_PBTM, seamless angle 
change flag SACF, first cell VOBU start address 
C_FVOBU_SA, and the last cell VOBU start address 
CJ_VOBU_SA. 

[0145] Note that seamless playback refers to the re- 
50 production in a digital video disk system of multimedia 
data including video, audio, and sub-picture data with- 
out intermittent breaks in the data or information . Seam- 
less playback is described in detail in another section 
below with reference to Fig. 23 and Fig. 24. 
55 [0146] The cell block mode CBM indicates whether 
plural cells constitute one functional block. The cell play- 
back information of each cell in a functional block is ar- 
ranged consecutively in the PGC information. The cell 
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block mode CBM of the first cell playback information in 
this sequence contains the value of the first cell in the 
block, and the cell block mode CBM of the last cell play- 
back information in this sequence contains the value of 
the last cell in the block. The cell block mode CBM of 
each cell arrayed between these first and last cells con- 
tains a value indicating that the cell is a cell between 
these first and last cells in that block. 
[0147] The cell block type CBT identifies the type of 
the block indicated by the cell block mode CBM. For ex- 
ample, when a multiple angle function is enabled, the 
cell information corresponding to each of the reproduc- 
ible angles is programmed as one of the functional 
blocks mentioned above, and the type of these function- 
al blocks is defined by a value identifying "angle" in the 
cell block type CBT for each cell in that block. 
[0148] The seamless playback flag SPF simply indi- 
cates whether the corresponding cell is to be linked and 
played back seamlessly with the cell or cell block repro- 
duced immediately therebefore. To seamlessly repro- 
duce a given cell with the preceding cell or cell block, 
the seamless playback flag SPF is set to 1 in the cell 
playback information for that cell; otherwise SPF is set 
too. 

[0149] The interleaved allocation flag IAF scores a 
value identifying whether the cell exists in a contiguous 
or interleaved block. If the cell is part of an interleaved 
block, the flag IAF is set to 1 ; otherwise it is set to 0. 
[0150] The STC resetting flag STCDF identifies 
whether the system time clock STC used for synchroni- 
zation must be reset when the cell is played back; when 
resetting the system time clock STC is necessary, the 
STC resetting flag STCDF is set to 1 . 
[01 51] The seamless angle change flag SACF stores 
a value indicating whether a cell in a multi-angle period 
should be connected seamlessly at an angle change. If 
the angle change is seamless, the seamless angle 
change flag SACF is set to 1 ; otherwise it is set to 0. 
[0152] The cell presentation time C_PBTM expresses 
the cell presentation time with video frame precision. 
[0153] The first cell VOBU start address 
C_FVOBlLSA is the VOBU start address of the first cell 
in a block, and is also expressed as the distance from 
the logic sector of the first cell in the VTS title VOBS 
(VTSTT_VOBS) as measured by the number of sectors. 
[0154] The last cell VOBU start address 
C_LVOBU_SA is the VOBU start address of the last cell 
in the block. The value of this address is expressed as 
the distance from the logic sector of the first cell in the 
VTS title VOBS (VTSTT_VOBS) as measured by the 
number of sectors. 

[0155] The VTS title VOBS (VTSTT_VOBS), i.e., the 
multimedia system stream data, is described next. The 
system stream data VTSTT_VOBS comprises i (where 
i is a natural number) system screams SS, each of which 
is referred to as a "video object" (VOB). Each video ob- 
ject VOB #1 - VOB #i comprises at least one video data 
block interleaved with up to a maximum eight audio data 



blocks and up to a maximum 32 sub-picture data blocks. 
[0156] Each video object VOB comprises q (where q 
is a natural number) cells C#1 - C#q. Each cell C com- 
prises r (where r is a natural number) video object units 

5 VOBU #1 -VOBU #r. 

[01 57] Each video object unit VOBU comprises plural 
groups_of_pictures GOP, and the audio and sub-pic- 
tures corresponding to the playback of said plural 
groups_of_pictures GOP. Note that the 

10 group_of_pictures GOP corresponds to the video en- 
coding refresh cycle. Each video object unit VOBU also 
starts with an NVpack, i.e., the control data for that VO- 
BU. 

[01 58] The structure of the navigation packs N V is de- 

15 scribed with reference to Fig. 19. 

[0159] Before describing the navigation pack NV, the 
internal structure of the video zone VZ (see Fig. 22), i. 
e., the system scream St35 encoded by the authoring 
encoder EC described with reference to Fig. 25, is de- 

20 scribed with reference to Fig. 17._Note that the encoded 
video stream St15 shown in Fig. 17 is the compressed 
one-dimensional video data stream encoded by the vid- 
eo encoder 3C0. The encoded audio stream St1 9 is like- 
wise the compressed one-dimensional audio data 

25 stream multiplexing the right and left stereo audio chan- 
nels encoded by the audio encoder 700. Note that the 
audio signal shall not be limited to a stereo signal, and 
may also be a multichannel surround-sound signal. 
[0160] The system stream (title editing unit VOB) St35 

30 is a one dimensional array of packs with a byte size cor- 
responding to the logic sectors LS #n having a 
2048-byte capacity as described using Fig. 22. A stream 
control pack is placed at the beginning of the title editing 
unit (VOB) St35, i.e., at the beginning of the video object 

35 unit VOBU. This stream control pack is called the "nav- 
igation pack NV", and records the data arrangement in 
the system stream and other control information. 
[0161] The encoded video stream St15 and the en- 
coded audio stream St19 are packetized in byte units 

40 corresponding to the system stream packs. These pack- 
ets are shown in Fig. 17 as packets V1, V2, V3, V4... 
and A1, A2, A3.... As shown in Fig. 17, these packets 
are interleaved in the appropriate sequence as system 
stream St35, thus forming a packer stream, with consid- 

45 eration given to the decoder buffer size and the time re- 
quired by the decoder to expand the video and audio 
data packets. In the example shown in Fig. 1 7, the pack- 
et stream is interleaved in the sequence V1 , V2, A1 , V3, 
V4, A2.... 

50 [01 62] Note that the sequence shown in Fig. 1 7 inter- 
leaves one video data unit with one audio data unit. Sig- 
nificantly increased recording/playback capacity, high 
speed recording/playback, and performance improve- 
ments in the signal processing LSI enable the DVD sys- 

55 tern to record plural audio data and plural sub-picture 
data (graphics data) to one video data unit in a single 
interleaved MPEG system stream, and thereby enable 
the user to select the specific audio data and sub-picture 
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data to be reproduced during playback. The structure of 
the system stream used in this type of DVD system is 
shown in Fig. 18 and described below. 
[0163] As in Fig. 17, the packetized encoded video 
stream St15 is shown in Fig. 1 8 as V1 , V2, V3, V4, ... In 
this example, however, there is not just one encoded 
audio stream St19, but three encoded audio streams 
St19A, St19B, and St19C input as the source data. 
There are also two encoded sub-picture streams St1 7A 
and St1 7B input as the source data sub-picture streams. 
These six compressed data streams, St15, St19A, 
St19B, St19C, St17A and St17B, are interleaved to a 
single system stream St35. 

[0164] The video data is encoded according to the 
MPEG specification with the group_of_pictures GOP 
being the unit of compression. In general, each 
group_of_pictures GOP contains 15 frames in the case 
of an NTSC signal, but the specific number of frames 
compressed to one GOP is variable. The stream man- 
agement pack, which describes the management data 
containing, for example, the relationship between inter- 
leaved data, is also interleaved at the GOP unit interval. 
Because the group_of_pictures GOP unit is based on 
the video data, changing the number of video frames 
per GOP unit changes the interval of the stream man- 
agement packs. This interval is expressed in terms of 
the presentation time on the digital video disk within a 
range from 0.4 sec. to 1 .0 sec. referenced to the GOP 
unit. If the presentation time of contiguous plural GOP 
units is less than 1 sec, the management data packs 
for the video data of the plural GOP units is interleaved 
to a single stream. 

[01 65] These management data packs are referred to 
as navigation packs NV in the digital video disk system. 
The data from one navigation pack NV to the packet im- 
mediately preceding the next navigation pack NV forms 
one video object unit VOBU. In general, one contiguous 
playback unit that can be defined as one scene is called 
a video object VOB, and each video object VOB con- 
tains plural video object units VOBU. Data sets of plural 
video objects VOB form a VOB set (VOBS). Note that 
these data units were first used in the digital video disk. 
[0166] When plural of these data streams are inter- 
leaved, the navigation packs NV defining the relation- 
ship between the interleaved packs must also be inter- 
leaved at a defined unit known as the pack number unit. 
Each group_of_pictures GOP is normally a unit contain- 
ing approximately 0. 5 sec. of video data, which is equiv- 
alent to the presentation time required for 12 - 15 
frames, and one navigation pack NV is generally inter- 
leaved with the number of data packets required for this 
presentation time. 

[0167] The stream management information con- 
tained in the interleaved video, audio, and sub-picture 
data packets constituting the system stream is de- 
scribed below with reference to Fig. 1 9 As shown in Fig. 
1 9, the data contained in the system stream is recorded 
in a format packed or packetized according to the 



MPEG2 standard. The packet structure is essentially 
the same for video, audio, and sub-picture data. One 
pack in the digital video disk system has a 2048 byte 
capacity as described above, and contains a pack head- 

5 er PKH and one packet PES; each packet PES contains 
a packet header PTH and data block. 
[0168] The pack header PKH records the time at 
which that pack is to be sent from stream buffer 2400 to 
system decoder 2500 (see Fig. 26), i.e., the system 

10 clock reference SCR defining the reference time for syn- 
chronized audio-visual data playback. The MPEG 
standard assumes that the system clock reference SCR 
is the reference clock for the entire decoder operation. 
With such disk media as the digital video disk, however, 

is time management specific to individual disk players can 
be used, and a reference clock for the decoder system 
is therefore separately provided. 
[0169] The packet header PTH similarly contains a 
presentation time stamp PTS and a decoding time 

20 stamp DTS, both of which are placed in the packet be- 
fore the access unit (the decoding unit). The presenta- 
tion time stamp PTS defines the time at which the video 
data or audio data contained in the packet should be 
output as the playback output after being decoded, and 

25 the decoding time stamp DTS defines the time at which 
the video stream should be decoded. Note that the pres- 
entation time stamp PTS effectively defines the display 
start timing of the access unit, and the decoding time 
stamp DTS effectively defines the decoding start timing 

30 of the access unit. If the PTS and DTS are the same 
time, the DTS is omitted. 

[0170] The packet header PTH also contains an 8-bit 
field called the stream ID identifying the packet type, i. 
e., whether the packet is a video packet containing a 
35 video data stream, a private packet, or an MPEG audio 
packet. 

[0171] Private packets under the MPEG2 standard 
are data packets of which the content can be freely de- 
fined. Private packet 1 in this embodiment of the inven- 

40 tion is used to carry audio data other than the MPEG 
audio data, and sub-picture data; private packet 2 car- 
ries the PCI packet and DSI packet. 
[0172] Private packets 1 and 2 each comprise a pack- 
et header, private data area, and data area. The private 

45 data area contains an 8-bit sub-stream ID indicating 
whether the recorded data is audio data or sub-picture 
data. The audio data defined by private packet 2 may 
be defined as any of eight types #0 - #7 of linear PCM 
or AC-3 encoded data. Subpicture data may be defined 

50 as one of up to 32 types #0 - #31 . 

[0173] The data area is the field to which data com- 
pressed according to the MPEG2 specification is written 
if the stored data is video data; linear PCM, AC-3, or 
MPEG encoded data is written if audio data is stored; 

55 or graphics data compressed by runlength coding is 
written if sub-picture data is stored. 
[0174] MPEG2-compressed video data may be com- 
pressed by constant bit rate (CBR) or variable bit rate 
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(VBR) coding. With constant bit rate coding, the video 
stream is input continuously to the video buffer at a con- 
stant rate. This contrasts with variable bit rate coding in 
which the video stream is input intermittently to the video 
buffer, thereby making it possible to suppress the gen- 
eration of unnecessary code. Both constant bit rate and 
variable bit rate coding can be used in the digital video 
disk system. 

[0175] Because MPEG video data is compressed with 
variable length coding, the data quantity in each 
group_of_pictures GOP is not constant. The video and 
audio decoding times also differ, and the time-base re- 
lationship between the video and audio data read from 
an optical disk, and the time-base relationship between 
the video and audio data output from the decoder, do 
not match. The method of time-base synchronizing the 
video and audio data is therefore described in detail be- 
low with reference to Fig. 26, but is described briefly be- 
low based on constant bit rate coding. 
[0176] The navigation pack NV structure is shown in 
Fig. 20. Each navigation pack NV starts with a pack 
header PKH, and contains a PCI packet and DSI packet. 
[0177] As described above, the pack header PKH 
records the time at which that pack is to be sent from 
stream buffer 2400 to system decoder 2500 (see Fig. 
26 ), i.e., the system clock reference SCR defining the 
reference time for synchronized audio-visual data play- 
back. 

[0178] Each PCI packet contains PCI General Infor- 
mation (PCI_Gl) and Angle Information for Non-seam- 
less playback (NMSL_AGLI). 

[0179] The PCI General Information (PCLGI) de- 
clares the display time of the first video frame (the Start 
PTM of VOBU (VOBU_S_PTM)), and the display time 
of the last video frame (End PTM of VOBU 
(VOBUJE_PTM)), in the corresponding video object 
unit VOBU with system clock precision (90 Khz). 
[0180] The Angle Information for Non-seamless play- 
back (NMSL_AGLl) states the read start address of the 
corresponding video object unit VOBU when the angle 
is changed-expressed as the number of sectors from the 
beginning of the video object VOB. Because there are 
nine or fewer angles in this example, there are nine an- 
gle address declaration cells: Destination Address of 
Angle Cell #1 for Non-seamless playback 
( N M S L_ A G L_C 1 _D STA) to Destination Address of An- 
gle Cell #9 for Non-seamless playback 
(NMSL_AGL_C9_DSTA). 

[0181] Each DSI packet contains DSI General Infor- 
mation (DSI_GI), Seamless Playback Information 
(SML_PBI), and Angle Information for Seamless play- 
back (SML_AGLI). 

[0182] The DSI General Information (DSIJ3I) de- 
clares the address of the last pack in the video object 
unit VOBU, i. e., the End Address for VOB (VOBU_EA), 
expressed as the number of sectors from the beginning 
of the video object unit VOBU. 

[01 83] While seamless playback is described in detail 



later, it should be noted that the continuously read data 
units must be interleaved (multiplexed) at the system 
stream level as an interleaved unit ILVU in order to 
seamlessly reproduce split or combined titles. Plural 
5 system streams interleaved with the interleaved unit IL- 
VU as the smallest unit are defined as an interleaved 
block. 

[0184] The Seamless Playback Information 
(SML_PBI) is declared to seamlessly reproduce the 

10 stream interleaved with the interleaved unit ILVU as the 
smallest data unit, and contains an Interleaved Unit Flag 
(ILVU flag) identifying whether the corresponding video 
object unit VOBU is an interleaved block. The ILVU flag 
indicates whether the video object unit VOBU is in an 

15 interleaved block, and is set to 1 when it is. Otherwise 
the ILVU flag is set to 0. 

[0185] When a video object unit VOBU is in an inter- * 
leaved block, a Unit END flag is declared to indicate 
whether the video object unit VOBU is the last VOBU in 

20 the interleaved unit ILVU. Because the interleaved unit 
ILVU is the data unit for continuous reading, the Unit 
END flag is set to 1 if the VOBU currently being read is 
the last VOBU in the interleaved unit ILVU. Otherwise 
the Unit END flag is set to 0. 

25 [0186] An Interleaved Unito End Address (ILVU_EA) 
identifying the address of the last pack in the ILVU to 
which the VOBU belongs, and the starting address of 
the next interleaved unit ILVU, Next Interleaved Unit 
Start Address (NTJLVU_SA), are also declared when 

30 a video object unit VOBU is in an interleaved block. Both 
the Interleaved Unit End Address (ILVU_EA) and Next 
Interleaved Unit Start Address (NT_ILVU_SA) are ex- 
pressed as the number of sectors from the navigation 
pack NV of that VOBU. 

35 [0187] When two system streams are seamlessly 
connected but the audio components of the two system 
streams are not contiguous, particularly immediately be- 
fore and after the seam, it is necessary to pause the au- 
dio output to synchronize the audio and video compo- 

40 nents of the system stream following the seam. Note 
that non-contiguous audio may result from different au- 
dio signals being recording with the corresponding video 
blocks. With an NTSC signal, for example, the video 
frame cycle is approximately 33. 33 msec while the AC- 

45 3 audio frame cycle is 32 msec. 

[0188] To enable this resynchronization, audio repro- 
duction stopping times 1 and 2, i.e., Audio Stop PTM 1 
in VOB (VOB__A_STP_PTM1), and Audio Stop PTM2 in 
VOB (VOB_A_STP_PTM2), indicating the time at which 

50 the audio is to be paused; and audio reproduction stop- 
ping periods 1 and 2, i.e., Audio Gap Length 1 in VOB 
(VOB_AJ3AP_LEN1) and Audio Gap Length 2 in VOB 
(VOB_A_GAP_LEN2), indicating for how long the audio 
is to be paused, are also declared in the DSI packet. 

55 Note that these times are specified at the system clock 
precision (90 Khz). 

[01 89] The Angle Information for Seamless playback 
(SML_AGLI) declares the read start address when the 
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angle is changed. Note that this field is valid when seam- 
less, multi-angle control is enabled. This address is also 
expressed as the number of sectors from the navigation 
pack NV of that VOBU. Because there are nine or fewer 
angles, there are nine angle address declaration cells: 
Destination Address of Angle Cell #1 for Seamless play- 
back (SML_AGL_C1_DSTA) to Destination Address of 
Angle Cell #9 for Seamless playback 
(SML_AGL_C9_DSTA). 

[0190] Note also that each title is edited in video ob- 
ject (VOB) units. Interleaved video objects (interleaved 
title editing units) are referenced as "VOBS"; and the 
encoded range of the source data is the encoding unit. 

DVD encoder 

[0191] A preferred embodiment of a digital video disk 
system authoring encoder ECD in which the multimedia 
bitstream authoring system according to the present in- 
vention is applied to a digital video disk system is de- 
scribed below and shown in Fig. 25. It will be obvious 
that the authoring encoder ECD applied to the digital vid- 
eo disk system, referred to below as a DVD encoder, is 
substantially identical to the authoring encoder EC 
shown in Fig. 2. The basic difference between these en- 
coders is the replacement in the DVD encoder ECD of 
the video zone formatter 1300 of the authoring encoder 
EC above with a VOB buffer 1000 and formatter 1100. 
it will also be obvious that the bitstream encoded by this 
DVD encoder ECD is recorded to a digital video disk 
medium M. The operation of this DVD encoder ECD is 
therefore described below in comparison with the au- 
thoring encoder EC described above. 
[0192] As in the above authoring encoder EC, the en- 
coding system controller 200 generates control signals 
St9, St11 , St13, St21 , St23, St25, St33, and St39 based 
on the scenario data St7 describing the user-defined ed- 
iting instructions input from the scenario editor 1 00, and 
controls the video encoder 300, sub-picture encoder 
500, and audio encoder 700 in the DVD encoder ECD. 
Note that the user-defined editing instructions in the 
DVD encoder ECD are a superset of the editing instruc- 
tions of the authoring encoder EC described above. 
[0193] Specifically, the user-defined editing instruc- 
tions (scenario data St7) in the DVD encoder ECD sim- 
ilarly describe what source data is selected from all or 
a subset of the source data containing plural titles within 
a defined time period, and how the selected source data 
is reassembled to reproduce the scenario (sequence) 
intended by the user. The scenario data St7 of the DVD 
encoder ECD, however, further contains such informa- 
tion as: the number of streams contained in the editing 
units, which are obtained by splitting a multi-title source 
stream into blocks at a constant time interval; the 
number of audio and sub-picture data cells contained in 
each stream, and the sub-picture display time and peri- 
od; whether the title is a multi-rated title enabling paren- 
tal lock control; whether the user content is selected 



from plural streams including, for example, multiple 
viewing angles; and the method of connecting scenes 
when the angle is switched among the multiple viewing 
angles. 

5 [0194] The scenario data St7 of the DVD encoder 
ECD also contains control information on a video object 
VOB unit basis. This information is required to encode 
the media source stream, and specifically includes such 
information as whether there are multiple angles orpa- 

10 rental control features. When multiple angle viewing is 
enabled, the scenario data St7 also contains the encod- 
ing bit rate of each stream considering data interleaving 
and the disk capacity, the start and end times of each 
control, and whether a seamless connection should be 

15 made between the preceding and following streams. 
[0195] The encoding system controller 200 extracts 
this information from the scenario data St7, and gener- 
ates the encoding information table and encoding pa- 
rameters required for encoding control. The encoding 

20 information table and encoding parameters are de- 
scribed with reference to Figs. 27, 28, and 29 below. 
[0196] The stream encoding data St33 contains the 
system stream encoding parameters and system en- 
coding start and end timing values required by the DVD 

25 system to generate the VOBs. These system stream en- 
coding parameters include the conditions for connecting 
one video object VOB with those before and after, the 
number of audio streams, the audio encoding informa- 
tion and audio Ids, the number of sub-pictures and the 

30 sub-picture Ids, the video playback starting time infor- 
mation VPTS, and the audio playback starting time in- 
formation APTS. 

[0197] The title sequence control signal St39 supplies 
the multimedia bitstream MBS formatting start and end 
35 timing information and formatting parameters declaring 
the reproduction control information and interleave in- 
formation. 

[0198] Based on the video encoding parameter and 
encoding start/end timing signal St9, the video encoder 

40 300 encodes a specific part of the video stream St1 to 
generate an elementary stream conforming to the 
MPEG2 Video standard defined in ISO-13818. This el- 
ementary stream is output to the video stream buff er 400 
as encoded video stream St15. 

45 [0199] Note that while the video encoder 300 gener- 
ates an elementary stream conforming to the MPEG2 
Video standard defined in ISO-1 381 8, specific encoding 
parameters are input via the video encoding parameter 
signal St9, including the encoding start and end timing, 

so bit rate, the encoding conditions for the encoding start 
and end, the material type, including whether the mate- 
rial is an NTSC or PAL video signal or telecine converted 
material, and whether the encoding mode is set for ei- 
ther open GOP or closed GOP encoding. 

55 [0200] The MPEG2 coding method is basically an in- 
terframe coding method using the correlation between 
frames for maximum signal compression, i.e., the frame 
being coded (the target frame) is coded by referencing 



35 



EP 0 920 203 B1 



36 



frames before and/or after the target frame. However, 
intra-coded frames, i. e. , frames that are coded based 
solely on the content of the target frame, are also insert- 
ed to avoid error propagation and enable accessibility 
from mid-stream (random access) . The coding unit con- 
taining at least one intra-coded frame ("intra-frame") is 
called a group_of_pictures GOP. 
[0201] A group_of_pictures GOP in which coding is 
closed completely within that GO P is known as a "closed 
GOP. " A group_of_pictures GOP containing a frame 
coded with reference to a frame in a preceding or fol- 
lowing (ISO-13818 DOES NOT LIMIT P- and B-picture 
CODING to referencing PAST frames) 
group_of_pictures GOP is an "open GOP." It is therefore 
possible to playback a closed GOP using only that GOP. 
Reproducing an open GOP, however, also requires the 
presence of the referenced GOP, generally the GOP 
preceding the open GOP. 

[0202] The GOP is often used as the access unit. For 
example, the GOP may be used as the playback start 
point for reproducing a title from the middle, as a tran- 
sition point in a movie, or for fast-forward play and other 
special reproduction modes. High speed reproduction 
can be achieved in such cases by reproducing only the 
intra-frame coded frames in a GOP or by reproducing 
only frames in GOP units. 

[0203] Based on the sub-picture stream encoding pa- 
rameter signal St11, the sub-picture encoder 500 en- 
codes a specific part of the sub-picture stream St3 to 
generate a variable length coded bitstream of bit- 
mapped data. This variable length coded bitstream data 
is output as the encoded sub-picture stream St1 7 to the 
sub-picture stream buffer 600. 

[0204] Based on the audio encoding parameter signal 
St13, the audio encoder 700 encodes a specific part of 
the audio stream St5 to generate the encoded audio da- 
ta. This encoded audio data may be data based on the 
MPEG1 audio standard defined in ISO-11T72 and the 
MPEG2 audio standard defined in ISO-13818, AC-3 au- 
dio data, or PCM (LPCM) data. Note that the methods 
and means of encoding audio data according to these 
standards are known and commonly available. 
[0205] The video stream buffer 400 is connected to 
the video encoder 300 and to the encoding system con- 
troller 200. The video stream buffer 400 stores the en- 
coded video stream St15 input from the video encoder 
300, and outputs the stored encoded video stream St15 
as the time-delayed encoded video stream St27 based 
on the timing signal St21 supplied from the encoding 
system controller 200. 

[0206] The sub-picture stream buffer 600 is similarly 
connected to the sub-picture encoder 500 and to the en- 
coding system controller 200. The sub-picture stream 
buffer 600 stores the encoded sub-picture stream St17 
input from the sub-picture encoder 500, and then out- 
puts the stored encoded sub-picture stream St17 as 
time-delayed encoded sub-picture stream St29 based 
on the timing signal St23 supplied from the encoding 



system controller 200. 

[0207] The audio stream buffer 800 is similarly con- 
nected to the audio encoder 700 and to the encoding 
system controller 200. The audio stream buffer 800 

5 stores the encoded audio stream St19 input from the 
audio encoder 700, and then outputs the encoded audio 
stream St1 9 as the time-delayed encoded audio stream 
St31 based on the timing signal St25 supplied from the 
encoding system controller 200. 

10 [0208] The system encoder 900 is connected to the 
video stream buffer 400, sub-picture stream buffer 600, 
audio stream buffer 800, and the encoding system con- 
troller 200, and is respectively supplied thereby with the 
time-delayed encoded video stream St27, time-delayed 

15 encoded sub-picture stream St29, time-delayed encod- 
ed audio stream St31 , and the system stream encoding 
parameter data St33. Note that the system encoder 900 
is a multiplexer that multiplexes the time-delayed 
streams St27, St29, and St31 based on the stream en- 

20 coding data St33 (timing signal) to generate title editing 
units (VOBs) St35. 

[0209] The VOB buffer 1000 temporarily stores the 
video objects VOBs produced by the system encoder 
900. The formatter 1 1 00 reads the delayed video objects 

25 VOB from the VOB buffer 1000 based on the title se- 
quence control signal St39 to generate one video zone 
VZ, and adds the volume file structure VFS to generate 
the edited multimedia stream data St43. 
[0210] The multimedia bitstream MBS St43 edited ac- 

30 cording to the user-defined scenario is then sent to the 
recorder 1 200. The recorder 1 200 processes the edited 
multimedia stream data St43 to the data stream St45 
format of the recording medium M, and thus records the 
formatted data stream St45 to the recording medium M. 

35 

DVD decoder 

[021 1 ] A preferred embodiment of a digital video disk 
system authoring decoder DCD in which the multimedia 

40 bitstream authoring system of the present invention is 
applied to a digital video disk system is described below 
and shown in Fig. 26. The authoring decoder DCD ap- 
plied to the digital video disk system, referred to below 
as a DVD decoder DCD, decodes the multimedia bit- 

45 stream MBS edited using the DVD encoder ECD of the 
present invention, and recreates the content of each title 
according to the user-defined scenario. It will also be 
obvious that the multimedia bitstream St45 encoded by 
this DVD encoder ECD is recorded to a digital video disk 

50 medium M. 

[0212] The basic configuration of the DVD decoder 
DCD according to this embodiment is the same as that 
of the authoring decoder DC shown in Fig. 3. The differ- 
ences are that a different video decoder 3801 (shown 

55 as 3800 in Fig. 26) is used in place of the video decoder 
3800, and a reordering buffer 3300 and selector 3400 
are disposed between the video decoder 3801 and syn- 
thesizer 3500. 
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[021 3] Note that the selector 3400 is connected to the 
synchronizer 2900, and is controlled by a switching sig- 
nal St103. 

[0214] The operation of this DVD decoder DCD is 
therefore described below in comparison with the au- 
thoring decoder DC described above. 
[0215] As shown in Fig. 26, the DVD decoder DCD 
comprises a multimedia bitstream producer 2000, sce- 
nario selector 2100, decoding system controller 2300, 
stream buffer 2400, system decoder 2500, video buffer 
2600, sub-picture buffer 2700, audio buffer 2800, syn- 
chronizer 2900, video decoder 3801 , reordering buffer 
3300, sub-picture decoder 3100, audio decoder 3200, 
selector 3400, synthesizer 3500, video data output ter- 
minal 3600, and audio data output terminal 3700. 
[0216] The bitstream producer 2000 comprises a re- 
cording media drive unit 2004 for driving the recording 
medium M; a reading head 2006 for reading the infor- 
mation recorded to the recording medium M and pro- 
ducing the binary read signal St57; a signal processor 
2008 for variously processing the read signal St57 to 
generate the reproduced bitstream St61 ; and a repro- 
duction controller 2002. 

[0217] The reproduction controller 2002 is connected 
to the decoding system controller 2300 from which the 
multimedia bitstream reproduction control signal St53 is 
supplied, and in turn generates the reproduction control 
signals St55 and St59 respectively controlling the re- 
cording media drive unit (motor) 2004 and signal proc- 
essor 2008. 

[0218] So that the user-defined video, sub-picture, 
and audio portions of the multimedia title edited by the 
authoring encoder EC are reproduced, the authoring de- 
coder DC comprises a scenario selector 21 00 for select- 
ing and reproducing the corresponding scenes (titles). 
The scenario selector 2100 then outputs the selected 
titles as scenario data to the DVD decoder DCD. 
[0219] The scenario selector 2100 preferably com- 
prises a keyboard, CPU, and monitor. Using the key- 
board, the user then inputs the desired scenario based 
on the content of the scenario input by the DVD encoder 
ECD. Based on the keyboard input, the CPU generates 
the scenario selection data St51 specifying the selected 
scenario. The scenario selector 2100 is connected to 
the decoding system controller 2300 by an infrared com- 
munications device, for example, and inputs the gener- 
ated scenario selection data St51 to the decoding sys- 
tem controller 2300. 

[0220] The stream buffer 2400 has a specific buffer 
capacity used to temporarily store the reproduced bit- 
stream St61 input from the bitstream producer 2000, ex- 
tract the volume file structure VFS, the initial synchroni- 
zation data SCR (system clock reference) in each pack, 
and the VOBU control information (DSI) in the naviga- 
tion pack NV, to generate the bitstream control data 
St63. The stream buffer 2400 is also connected to the 
decoding system controller 2300 , to which it supplies the 
generated bitstream control data St63. 



[0221] Based on the scenario selection data St51 
supplied by the scenario selector 2100, the decoding 
system controller 2300 then generates the bitstream re- 
production control signal St53 controlling the operation 

5 of the bitstream producer 2000. The decoding system 
controller 2300 also extracts the user-defined playback 
instruction data from the bitstream reproduction control 
signal St53, and generates the decoding information ta- 
ble required for decoding control. This decoding infor- 

10 mation table is described further below with reference 
to Figs. 26 and 32. The decoding system controller 2300 
also extracts the title information recorded to the optical 
disk M from the file data structure area FDS of the bit- 
stream control data St63 to generate the title information 

15 signal St200. Note that the extracted title information in- 
cludes the video manager VMG, VTS information VTSI, 
the PGC information entries C_PBI #j, and the cell pres- 
entation time C_PBTM. 

[0222] Note that the bitstream control data St63 is 
20 generated in pack units as shown in Fig. 1 9, and is sup- 
plied from the stream buffer 2400 to the decoding sys- 
tem controller 2300, to which the stream buffer 2400 is 
connected. 

[0223] The synchronizer 2900 is connected to the de- 
25 coding system controller 2300 from which it receives the 
system clock reference SCR contained in the synchro- 
nization control data St81 to set the internal system 
clock STC and supply the reset system clock St79 to the 
decoding system controller 2300. 
30 [0224] Based on this system clock St79, the decoding 
system controller 2300 also generates the stream read 
signal St65 at a specific interval and outputs the read 
signal St65 to the stream buffer 2400. Note that the read 
unit in this case is the pack. 
35 [0225] The method of generating the stream read sig- 
nal St65 is described next. 

[0226] The decoding system controller 2300 com- 
pares the system clock reference SCR contained in the 
stream control data extracted from the stream buffer 

40 2400 with the system clock St79 supplied from the syn- 
chronizer 2900, and generates- the read request signal 
St65 when the system clock St79 is greaterthan the sys- 
tem clock reference SCR of the bitstream control data 
St63. Pack transfers are controlled by executing this 

45 control process on a pack unit. 

[0227] Based on the scenario selection data St51 , the 
decoding system controller 2300 generates the decod- 
ing signal St69 defining the stream Ids forthe video, sub- 
picture, and audio bitstreams corresponding to the se- 

50 lected scenario, and outputs to the system decoder 
2500. 

[0228] When a title contains plural audio tracks, e.g. 
audio tracks in Japanese, English, French, and/or other 
languages, and plural sub-picture tracks for subtitles in 
55 Japanese, English, French, and/or other languages, for 
example, a discrete ID is assigned to each of the lan- 
guage tracks. As described above with reference to Fig. 
1 9, a stream ID is assigned to the video data and MPEG 
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audio daca, and a substream ID is assigned to the sub- 
picture data, AC-3 audio data, linear PCM data, and 
navigation pack NV information. While che user need 
never be aware of these ID numbers, the user can select 
the language of the audio and/or subtitles using the sce- 
nario selector 21 00. If English language audio is select- 
ed, for example, the ID corresponding . to the English 
audio track is sent to the decoding system controller 
2300 as scenario selection data St51 . The decoding 
system controller 2300 then adds this ID to the decoding 
signal St69 output to the system decoder 2500. 
[0229] Based on the instructions contained in the de- 
coding signal St69, the system decoder 2500 respec- 
tively outputs the video, sub-picture, and audio bit- 
streams input from the stream buffer 2400 to the video 
buffer 2600, sub-picture buffer 2700, and audio buffer 
2800 as the encoded video stream St71 , encoded sub- 
picture stream St73, and encoded audio stream St75. 
Thus, when the stream ID input from the scenario se- 
lector 21 00 and the pack ID input from the stream buffer 
2400 match, the system decoder 2500 outputs the cor- 
responding packs to the respective buffers (i.e., the vid- 
eo buffer 2600, sub-picture buffer 2700, and audio buffer 
2800). 

[0230] The system decoder 2500 detects the presen- 
tation time stamp PTS and decoding time stamp DTS of 
the smallest control unit in each bitstream St67 to gen- 
erate the time information signal St77. This time infor- 
mation signal St77 is supplied to the synchronizer 2900 
through the decoding system controller 2300 as the syn- 
chronization control data St81 . 
[0231] Based on this synchronization control data 
St81 , the synchronizer 2900 determines the decoding 
start timing whereby each of the bitstreams will be ar- 
ranged in the correct sequence after decoding, and then 
generates and inputs the video stream decoding start 
signal St89 to the video decoder 3801 based on this de- 
coding timing. The synchronizer 2900 also generates 
and supplies the sub-picture decoding start signal St91 
and audio stream decoding start signal St93 to the sub- 
picture decoder 31 00 and audio decoder 3200, respec- 
tively. 

[0232] The video decoder 3801 generates the video 
output request signal St84 based on the video stream 
decoding start signal St89, and outputs to the video buff- 
er 2600. In response to the video output request signal 
St84, the video buffer 2600 outputs the video stream 
St83 to the video decoder 3801 . The video decoder 
3801 thus detects the presentation time information 
contained in the video stream St83, and disables the vid- 
eo output request signal St84 when the length of the re- 
ceived video stream St83 is equivalent to the specified 
presentation time. A video stream equal in length to the 
specified presentation time is thus decoded by the video 
decoder 3801 , which outputs the reproduced video sig- 
nal St95 to the reordering buffer 3300 and selector 3400. 
[0233] Because the encoded video stream is coded 
using the interframe correlations between pictures, the 



coded order and display order do not necessarily match 
on a frame unit basis. The video cannot, therefore, be 
displayed in the decoded order. The decoded frames 
are therefore temporarily stored to the reordering buffer 

5 3300. The synchronizer 2900 therefore controls the 
switching signal St1 03 so that the reproduced video sig- 
nal St95 output from the video decoder 3800 and the 
reordering buffer output St97 are appropriately selected 
and output in the display order to the synthesizer 3500. 

10 [0234] The sub-picture decoder 31 00 similarly gener- 
ates the sub-picture output request signal St86 based 
on the sub-picture decoding start signal St91 , and out- 
puts to the sub-picture buffer 2700. In response to the 
sub-picture output request signal St86, the sub-picture 

15 buffer 2700 outputs the sub-picture stream St85 to the 
sub-picture decoder 3100. Based on the presentation 
time information contained in the sub-picture stream 
St85, the sub-picture decoder 31 00 decodes a length of 
the sub-picture stream St85 corresponding to the spec- 

20 ified presentation time to reproduce and supply to the 
synthesizer 3500 the sub-picture signal St99. 
[0235] The synthesizer3500 superimposes the selec- 
tor 3400 output with the sub-picture signal St99 to gen- 
erate and output the video signal St1 05 to the video data 

25 output terminal 3600. 

[0236] The audio decoder 3200 generates and sup- 
plies to the audio buffer 2800 the audio output request 
signal St88 based on the audio stream decoding start 
signal St93. The audio buffer 2800 thus outputs the au- 

30 dio scream St87 to the audio decoder 3200. The audio 
decoder 3200 decodes a length of the audio stream 
St87 corresponding to the specified presentation time 
based on the presentation time information contained in 
the audio stream St87, and outputs the decoded audio 

35 stream St1 01 to the audio data output terminal 3700. 
[0237] It is thus possible to reproduce a user-defined 
multimedia bitstream MBS in real-time according to a 
user-defined scenario. More specifically, each time the 
user selects a different scenario, the DVD decoder DCD 

40 is able to reproduce the title content desired by the user 
in the desired sequence by reproducing the multimedia 
bitstream MBS corresponding to the selected scenario. 
[0238] It should be noted that the decoding system 
controller 2300 may supply the title information signal 

45 St200 to the scenario selector 21 00 by means of the in- 
frared communications device mentioned above or an- 
other means. Interactive scenario selection controlled 
by the user can also be made possible by the scenario 
selector2100 extracting the title information recordedto 

so the optical disk M from the file data structure area FDS 
of the bitstream control data St63 contained in the title 
information signal St200, and displaying this title infor- 
mation on a display for user selection. 
[0239] Note, further, that the stream buffer 2400, vid- 

55 eo buffer 2600, sub-picture buffer 2700, audio buffer 
2800, and reordering buffer 3300 are expressed above 
and in the figures as separate entities because they are 
functionally different. It will be obvious, however, that a 
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single buffer memory can be controlled to provide the 
same discrete functionality by time-share controlled use 
of a buffer memory with an operating speed plural times 
faster than the read and write rates of these separate 
buffers. 

Multi-scene control 

[0240] The concept of multiple angle scene control 
according to the present invention is described below 
with reference to Fig. 21 . As described above, titles that 
can be played back with numerous variations are con- 
structed from basic scene periods containing data com- 
mon to each title, and multi-scene periods comprising 
groups of different scenes corresponding to the various 
scenario requests. In Fig. 21 , scenes 1 , 5, and 8 are the 
common scenes of the basic scene periods. The multi- 
angle scenes (angles 1 , 2, and 3) between scenes 1 and 
5, and the parental locked scenes (scenes 6 and 7) be- 
tween scenes 5 and 8, are the multi-scene periods. 
[0241] Scenes taken from different angles, i.e., an- 
gles 1 , 2, and 3 in this example, can be dynamically se- 
lected and reproduced during playback in the multi-an- 
gle scene period. In the parental locked scene period, 
however, only one of the available scenes, scenes 6 and 
7, having different content can be selected, and must be 
selected statically before playback begins. 
[0242] Which of these scenes from the multi-scene 
periods is to be selected and reproduced is defined by 
the user operating the scenario selector21 00 and there- 
by generating the scenario selection data St51 . In sce- 
nario 1 in Fig. 21 the user can freely select any of the 
multi-angle scenes, and scene 6 has been preselected 
for output in the parental locked scene period. Similarly 
in scenario 2, the user can freely select any of the multi- 
angle scenes, and scene 7 has been preselected for 
output in the parental locked scene period. 
[0243] With reference to Figs. 30 and 31 furthermore, 
the contents of the "program chain information 
VTS_PGCI is described. In Fig. 30, the case that a sce- 
nario requested by the user is shown with respect to a 
VTSI data construction. The scenario 1 and scenario 2 
shown in Fig. 21 are described as program chain infor- 
mation VTS_PGC#1 and VTS_PGC#2. VTS_PGC#1 
describing the scenario 1 consists of cell playback infor- 
mation C_PBI#1 corresponding to scene 1 , C_PBI#2, 
C_PBI#3, and C_PBI#4 within a multi-angle cell block, 
C_PBI#5 corresponding to scene 5, C_PBI#6 corre- 
sponding to scene 6, and C_PBI#7 corresponding to 
scene 8. 

[0244] VTS_PGCI#2 describing the scenario 2 con- 
sists of cell playback information C_PB!#1 correspond- 
ing to scene 1 , C_PBI#2, C_PBI#3, and C_PBI#4 within 
a multi-angle cell block corresponding to a multi-angle 
scene, C_PBI#5 corresponding to scene 5, C_PBI#6 
corresponding to scene 7, and C_PBI#7 corresponding 
to scene 8. According to the digital video system data 
structure, a scene which is a control unit of a scenario 



is described as a cell which is a unit thereunder, thus a 
scenario requested by a user can be obtained. 
[0245] In Fig. 31 , the case that a scenario requested 
by the user shown in Fig. 21 is shown with respect to a 

5 VOB data construction VTSTT_VOBS. As specifically 
shown in Fig. 31, the two scenarios 1 and 2 use the 
same VOB data in common. With respect to a single 
scene commonly owned by each scenario, VOB#1 cor- 
responding to scene 1 , VOB#5 corresponding to scene 

10 5, and VOB#8 corresponding to scene 8 are arranged 
in non-interleaved block which is the contiguous block. 
[0246] With respect to the multi-angle data commonly 
owned by scenarios 1 and 2, one angle scene data is 
constructed by a single VOB. Specifically speaking, an- 

15 gie 1 is constructed by VOB#2, and angle 2 is construct- 
ed by VOB#3, angle 3 is constructed by VOB#4. Thus 
constructed multi-angle data is formed as the inter- 
leaved block for the sake of switching between each an- 
gle and seamless reproduction of each angle data. 

20 Scenes 6 and 7 peculiar to scenarios 1 and 2, respec- 
tively, are formed as the interleaved block for the sake 
of seamless reproduction between common scenes be- 
fore and behind thereof as well as seamless reproduc- 
tion between each scene. 

25 [0247] As described in the above, the user's request- 
ing scenario shown in Fig. 21 can be realized by utilizing 
the video title playback control information shown in Fig. 
30 and the title playback VOB data structure shown in 
Fig. 31. 

30 

Seamless playback 

[0248] The seamless playback capability briefly men- 
tioned above with regard to the digital video disk system 

35 data structure is described below. Note that seamless 
playback refers to the reproduction in a digital video disk 
system of multimedia data including video, audio, and 
sub-picture data without intermittent breaks in the data 
or information between basic scene periods, between 

40 basic scene periods and multi-scene periods, and be- 
tween multi-scene periods. 

[0249] Hardware factors contributing to intermittent 
playback of this data and title content include decoder 
underflow, i.e., an imbalance between the source data 
45 input speed and the decoding speed of the input source 
data. 

[0250] Other factors relate to the properties of the 
playback data. When the playback data is data that must 
be continuously reproduced for a constant time unit in 

50 order for the user to understand the content or informa- 
tion, e.g., audio data, data continuity is lost when the 
required continuous presentation time cannot be as- 
sured. Reproduction of such information whereby the 
required continuity is assured is referred to as "contig- 

55 uous information reproduction," or "seamless informa- 
tion reproduction." Reproduction of this information 
when the required continuity cannot be assured is re- 
ferred to as "non-continuous information reproduction," 
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or "non-seamless information reproduction." It is obvi- 
ous that continuous information reproduction and non- 
continuous information reproduction are, respectively, 
seamless and non-seamless reproduction. 
[0251] Note that seamless reproduction can be fur- 
ther categorized as seamless data reproduction and 
seamless information reproduction. Seamless data re- 
production is defined as preventing physical blanks or 
interruptions in the data playback (intermittent reproduc- 
tion) as a result of a buffer underflow state, for example. 
Seamless information reproduction is defined as pre- 
venting apparent interruptions in the information when 
perceived by the user (intermittent presentation) when 
recognizing information from the playback data where 
there are no actual physical breaks in the data repro- 
duction. 

Details of Seamless playback 

[0252] The specific method enabling seamless repro- 
duction as thus described is described later below with 
reference to Figs. 23 and 24. 

Interleaving 

[0253] The DVD data system streams described 
above are recorded using an appropriate authoring en- 
coder EC as a movie or other multimedia title on a DVD 
recording medium. Note that the following description 
refers to a movie as the multimedia title being proc- 
essed, but it will be obvious that the invention shall not 
be so limited. 

[0254] Supplying a single movie in a format enabling 
the movie to be used in plural different cultural regions 
or countries requires the script to be recorded in the var- 
ious languages used in those regions or countries. It 
may even necessitate editing the content to conform to 
the mores and moral expectations of different cultures. 
Even using such a large-capacity storage system as the 
DVD system, however, it is necessary to reduce the bit 
rate, and therefore the image quality, if plural full-length 
titles edited from a single common source title are re- 
corded to a single disk. This problem can be solved by 
recording the common parts of plural titles only once, 
and recording the segments different in each title for 
each different title only. This method makes it possible 
to record plural titles for different countries or cultures 
to a single optical disk without reducing the bit rate, and, 
therefore, retaining high image quality. 
[0255] As shown in Fig. 21 , the titles recorded to a 
single optical disk contain basic scene periods of scenes 
common to all scenarios, and multi-scene periods con- 
taining scenes specific to certain scenarios, to provide 
parental lock control and multi-angle scene control func- 
tions. 

[0256] I n the case of the parental lock control function , 
titles containing sex scenes, violent scenes, or other 
scenes deemed unsuitable for children, i.e., so-called 



"adult scenes," are recorded with a combination of com- 
mon scenes, adult scenes, and children's scenes. 
These title streams are achieved by arraying the adult 
and children's scenes to multi-scene periods between 

5 the common basic scene periods. 

[0257] Multi-angle control can be achieved in a con- 
ventional single-angle title by recording plural multime- 
dia scenes obtained by recording the subjects from the 
desired plural camera angles to the multi-scene periods 

10 arrayed between the common basic scene periods. 
Note, however, that while these plural scenes are de- 
scribed here as scenes recorded from different camera 
angles (positions), it will be obvious that the scenes may 
be recorded from the same camera angle but at different 

15 times, data generated by computer graphics, or other 
video data. 

[0258] When data is shared between different scenar- 
ios of a single title, it is obviously necessary to move the 
laser beam LS from the common scene data to the non- 
20 common scene data during reproduction, i.e., to move 
the optical pickup to a different position on the DVD re- 
cording medium RC1 . The problem here is that the time 
required to move the optical pickup makes it difficult to 
continue reproduction without creating breaks in the au- 
25 dio or video, i.e., to sustain seamless reproduction. This 
problem can be theoretically solved by providing a track 
buffer (stream buffer 2400) to delay data output an 
amount equivalent to the worst access time. In general, 
data recorded to an optical disk is read by the optical 
30 pickup, appropriately processed, and temporarily stored 
to the track buffer. The stored data is subsequently de- 
coded and reproduced as video or audio data. 

Definition of Interleaving 

35 

[0259] To thus enable the user to selectively excise 
scenes and choose from among plural scenes, a state 
wherein non-selected scene data is recorded inserted 
between common scene data and selective scene data 

40 necessarily occurs because the data units associated 
with individual scenes are contiguously recorded to the 
recording tracks of the recording medium. If data is then 
■ read in the recorded sequence, non-selected scene da- 
ta must be accessed before accessing and decoding the 

45 selected scene data, and seamless connections with 
the selected scene is difficult. The excellent random ac- 
cess characteristics of the digital video disk system, 
however, make seamless connections with the selected 
scenes possible. 

50 [0260] In other words, by splitting scene-specific data 
into plural units of a specified data size, and interleaving 
plural split data units for different scenes in a predefined 
sequence that is recorded to disk within the jumping 
range whereby an data underflow state does not occur, 

55 it is possible to reproduce the selected scenes without 
data interruptions by intermittently accessing and de- 
coding the data specific to the selected scenes using 
these split data units. Seamless data reproduction is 
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thereby assured. 

Interleaved Block and Interleave Unit 

[0261] The interleaving method enabling seamless 
data reproduction according to the present invention is 
described below with reference to Fig. 24 and Fig. 71. 
Shown in Fig. 24 is a case from which three scenarios 
may be derived, i.e., branching from one video object 
VOB-A to one of plural video objects VOB-B, VOB-C, 
and VOB-D, and then merging back again to a single 
video object VOB-E. The actual arrangement of these 
blocks recorded to a data recording track TR on disk is 
shown in Fig. 71. 

[0262] Referring to Fig. 71 , VOB-A and VOB-E are 
video objects with independent playback start and end 
times, and are in principle arrayed to contiguous block 
regions. As shown in Fig. 24, the playback start and end 
times of VOB-B, VOB-C, and VOB-D are aligned during 
interleaving. The interleaved data blocks are then re- 
corded to disk to a contiguous interleaved block region. 
The contiguous block regions and interleaved block re- 
gions are then written to disk in the track path Dr direc- 
tion in the playback sequence. Plural video objects 
VOB, i.e., interleaved video objects VOBS, arrayed to 
the data recording track TR are shown in Fig. 37. 
[0263] Referring to Fig. 37, data regions to which data 
is continuously arrayed are called "blocks," of which 
there are two types: "contiguous block regions" in which 
VOB with discrete starting and end points are contigu- 
ously arrayed, and "interleaved block regions" in which 
plural VOB with aligned starting and end points are in- 
terleaved. The respective blocks are arrayed as shown 
in Fig. 38 in the playback sequence, i.e., block 1 , block 
2, block 3, . . . block 7. 

[0264] As shown in Fig. 73, the VTS title VOBS 
(VTSTT^VOBS) consist of blocks 1 - 7, inclusive. Block 
1 contains VOB 1 alone. Blocks 2, 3, 5, and 7 similarly 
discretely contain VOBS 2, 3, 6, and 1 0. Blocks 2, 3, 5, 
and 7 are thus contiguous block regions. 
[0265] Block 4, however, contains VOB 4 and VOB 5 
interleaved together, while block 6 contains VOB 7, VOB 
8, and VOB 9 interleaved together. Blocks 4 and 6 are 
thus interleaved block regions. 

[0266] The internal data structure of the contiguous 
block regions is shown in Fig. 73 with VOB-i and VOB- 
j arrayed as the contiguous blocks in the VOBs. As de- 
scribed with reference to Fig. 16, VOB-i and VOB-j in- 
side the contiguous block regions are further logically 
divided into cells as the playback unit. Both VOB-i and 
VOB-j in this figure are shown comprising three cells 
CELL #1 , CELL #2, and CELL #3. 
[0267] Each cell comprises one or more video object 
unit VOBU with the video object unit VOBU defining the 
boundaries of the cell. Each cell also contains informa- 
tion identifying the position of the cell in the program 
chain PGC (the playback control information of the dig- 
ital video disk system) . More specifically, this position 



information is the address of the first and last VOBU in 
the cell. As also shown in Fig. 73, these VOB and the 
cells defined therein are also recorded to a contiguous 
block region so that contiguous blocks are contiguously 
5 reproduced. Reproducing these contiguous blocks is 
therefore no problem. 

[0268] The internal data structure of the interleaved 
block regions is shown in Fig. 74. In the interleaved 
block regions each video object VOB is divided into in- 
fo terleaved units ILVU, and the interleaved units ILVU as- 
sociated with each VOB are alternately arrayed. Cell 
boundaries are defined independently of the interleaved 
units ILVU. For example, VOB-k is divided into four in- 
terleaved units ILVUkl, ILVUk2, ILVUk3, and ILVUk4 t 
15 and are confined by a single cell CELL#k. VOB-k is like- 
wise divided into four interleaved units ILVUml, 
ILVUm2, ILVUm3, and !LVUm4, and is confined by a sin- 
cle cell CELL#m. Note that instead of a single cell 
CELL#k or CELL#m, each of VOB-k and VOB-m can be 
20 divided into more than two cells. The interleaved units 
ILVU thus contains both audio and video data. 
[0269] In the example shown in Fig. 74, the inter- 
leaved units ILVUkl , ILVUk2, ILVUk3, and ILVUk4, and 
ILVUml, ILVUm2, lLVUm3, and ILVUm4, from two dif- 
25 ferent video objects VOB-k and VOB-m are alternately 
arrayed within a single interleaved block. By interleaving 
the interleaved units ILVU of two video objects VOB in 
this sequence, it is possible to achieve seamless repro- 
duction branching from one scene to one of plural 
30 scenes, and from one of plural scenes to one scene. 

Multi-scene control 

[0270] The multi-scene period is described together 
35 with the concept of multi-scene control according to the 
present invention using by way of example a title com- 
prising scenes recorded from different angles. 
[0271] Each scene in multi-scene control is recorded 
from the same angle, but may be recorded at different 
40 times or may even be computer graphics data. The mul- 
ti-angle scene periods may therefore also be called mul- 
ti-scene periods. 

Parental control 

45 

[0272] The concept of recording plural titles compris- 
ing alternative scenes for such functions as parental 
lock control and recording director's cuts is described 
below using Fig. 15. 

so [0273] An example of a multi-rated title stream provid- 
ing for parental lock control is shown in Fig. 15. When 
so-called "adult scenes" containing sex, violence, or 
other scenes deemed unsuitable for children are con- 
tained in a title implementing parental lock control, the 

55 title stream is recorded with a combination of common 
system streams SSa, SSb, and Sse, an adult-oriented 
system stream SSc containing the adult scenes, and a 
child-oriented system stream SSd containing only the 
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scenes suitable for children. Title streams such as this 
are recorded as a multi-scene system stream containing 
the adult-oriented system stream Ssc and the child-ori- 
ented system stream Ssd arrayed to the multi-scene pe- 
riod between common system streams Ssb and Sse. 
[0274] The relationship between each of the compo- 
nent titles and the system stream recorded to the pro- 
gram chain PGC of a title stream thus comprised is de- 
scribed below. 

[0275] The adult-oriented title program chain PGC1 
comprises in sequence the common system streams 
Ssa and Ssb, the adult-oriented system stream Ssc, and 
the common system stream Sse. The child-oriented title 
program chain PGC2 comprises in sequence the com- 
mon system streams Ssa and Ssb, the child-oriented 
system stream Ssd, and che common system stream 
Sse. 

[0276] By thus arraying the adult-oriented system 
stream Ssc and child-oriented system stream Ssd to a 
multi-scene period, the decoding method previously de- 
scribed can reproduce the title containing adult-oriented 
content by reproducing the common system streams 
Ssa and Ssb, then selecting and reproducing the adult- 
oriented system stream Ssc, and then reproducing the 
common system stream Sse as instructed by the adult- 
oriented title program chain PGC1 . By alternatively fol- 
lowing the child-oriented title program chain PGC2 and 
selecting the child-oriented system stream Ssd in the 
multi-scene period, a child-oriented title from which the 
adult-oriented scenes have been expurgated can be re- 
produced. 

[0277] This method of providing in the title stream a 
multi-scene period containing plural alternative scenes, 
selecting which of the scenes in the multi-scene period 
are to be reproduced before playback begins, and gen- 
erating plural titles containing essentially the same title 
content but different scenes in part, is called parental 
lock control. 

[0278] Note that parental lock control is so named be- 
cause of the perceived need to protect children from un- 
desirable content. From the perspective of system 
stream processing, however, parental lock control is a 
technology for statically generating different title 
streams by means of the user preselecting specific 
scenes from a multi-scene period. Note, further, that this 
contrasts with multi-angle scene control, which is a tech- 
nology for dynamically changing the content of a single 
title by means of the user selecting scenes from the mul- 
ti-scene period freely and in real-time during title play- 
back. 

[0279] This parental lock control technology can also 
be used to enable title stream editing such as when mak- 
ing the director's cut. The director's cut refers to the 
process of editing certain scenes from a movie to, for 
example, shorten the total presentation time. This may 
be necessary, for example, to edit a feature-length mov- 
ie for viewing on an airplane where the presentation time 
is too long for viewing within the flight time or certain 



content may not be acceptable. The movie director thus 
determines which scenes may be cut to shorten the 
movie. The title can then be recorded with both a full- 
length, unedited system stream and an edited system 

5 stream in which the edited scenes are recorded to multi- 
scene periods. At the transition from one system stream 
to another system stream in such applications, parental 
lock control must be able to maintain smooth playback 
image output. More specifically, seamless data repro- 

10 duction whereby a data underflow state does not occur 
in the audio, video, or other buffers, and seamless in- 
formation reproduction whereby no unnatural interrup- 
tions are audibly or visibly perceived in the audio and 
video playback, are necessary. 

15 

Multi-angle control 

[0280] The concept of multi-angle scene control in the 
present invention is described next with reference to Fig. 

20 33. In general, multimedia titles are obtained by record- 
ing both the audio and video information (collectively 
"recording" below) of the subject over time T The angled 
scene blocks #SC1, #SM1, #SM2, #SM3, and #SC3 
represent the multimedia scenes obtained at recording 

25 unit times T1 , T2, and T3 by recording the subject at 
respective camera angles. Scenes #SM1, #SM2, and 
#SM3 are recorded at mutually different (first, second, 
and third) camera angles during recording unit time T2, 
and are referenced below as the first, second, and third 

30 angled scenes. 

[0281] Note that the multi-scene periods referenced 
herein are basically assumed to comprise scenes re- 
corded from different angles. The scenes may, however, 
be recorded from the same angle but at different times, 

35 or they may be computer graphics data. The multi-angle 
scene periods are thus the multi-scene periods from 
which plural scenes can be selected for presentation in 
the same time period, whether or not the scenes are ac- 
tually recorded at different camera angles. 

40 [0282] Scenes #SC1 and #SC3 are scenes recorded 
at the same common camera angle during recording 
unit times T1 and T3, i.e., before and after the multi- 
angle scenes. These scenes are therefore called "ccm- 
mon angle scenes." Note that one of the multiple camera 

45 angles used in the multi-angle scenes is usually the 
same as the common camera angle. 
[0283] To understand the relationship between these 
various angled scenes, multi-angle scene control is de- 
scribed below using a live broadcast of a baseball game 

so for example only. 

[0284] The common angle scenes #SC1 and #SC3 
are recorded at the common camera angle, which is 
here defined as the view from center field on the axis 
through the pitcher, batter, and catcher. 

55 [0285] The first angled scene #SM 1 is recorded at the 
first multi-camera angle, i.e., the camera angle from the 
backstop on the axis through the catcher, pitcher, and 
batter. The second angled scene #SM2 is recorded at 
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the second multi-camera angle, i.e., the view from cent- 
erf ield on the axis through the pitcher, batter, and catch- 
er. Note that the second angled scene #SM2 is thus the 
same as the common camera angle in this example. It 
therefore follows that the second angled scene #SM2 is 
the same as the common angle scene #SC2 recorded 
during recording unit time T2. The third angled scene 
#SM3 is recorded at the third multi-camera angle, i.e., 
the camera angle from the backstop focusing on the in- 
field. 

[0286] The presentation times of the multiple angle 
scenes #SM1, #SM2, and #SM3 overlap in recording 
unit time T2; this period is called the "multi-angle scene 
period." By freely selecting one of the multiple angle 
scenes #SM1, #SM2, and #SM3 in this multi-angle 
scene period, the viewer is able to change his or her 
virtual viewing position to enjoy a different view of the 
game as though the actual camera angle is changed. 
Note that while there appears to be a time gap between 
common angle scenes #SC1 and #SC3 and the multiple 
angle scenes #SM1 , #SM2, and #SM3 in Fig. 33, this is 
simply to facilitate the use of arrows in the figure for eas- 
ier description of the data reproduction paths repro- 
duced by selecting different angled scenes. There is no 
actual time gap during playback. 
[0287] Multi-angle scene control of the system stream 
based on the present invention is described next with 
reference to Fig. 23 from the perspective of connecting 
data blocks. The multimedia data corresponding to com- 
mon angle scene #SC is referenced as common angle 
data BA, and the common angle data BA in recording 
unit times T1 and T3 are referenced as BA1 and BA3, 
respectively. The multimedia data corresponding to the 
multiple angle scenes #SM1 , #SM2, and #SM3 are ref- 
erenced as first, second, and third angle scene data 
MA1 , MA2, and MA3. As previously described with ref- 
erence to Fig. 33, scenes from the desired angled can 
be viewed by selecting one of the multiple angle data 
units MA1, MA2, and MA3. There is also no time gap 
between the common angle data 3A1 and BA3 and the 
multiple angle data units MA1 , MA2, and MA3. 
[0288] In the case of an MPEG system stream, how- 
ever, intermittent breaks in the playback information can 
result between the reproduced common and multiple 
angle data units depending upon the content of the data 
at the connection between the selected multiple angle 
data unit MA1, MA2, and MA3 and the common angle 
data BA (either the first common angle data BA1 before 
the angle selected in the multi-angle scene period or the 
common angle data BA3 following the angle selected in 
the multi-angle scene period). The result in this case is 
that the title stream is not naturally reproduced as a sin- 
gle contiguous title, i.e., seamless data reproduction is 
achieved but non-seamless information reproduction 
results. 

[0289] The multi-angle selection process whereby 
one of plural scenes is selectively reproduced from the 
multi-angle scene period with seamless information 



presentation to the scenes before and after is described 
below with application in a digital video disk system us- 
ing Fig. 23. 

[0290] Changing the scene angle, i.e., selecting one 

5 of the multiple angle data units MA1, MA2, and MA3, 
must be completed before reproduction of the preceding 
common angle data BA1 is completed. It is extremely 
difficult, for example, to change to a different angle data 
unit MA2 during reproduction of common angle data 

10 BA1 . This is because the multimedia data has a variable 
length coded MPEG data structure, which makes it dif- 
ficult to find the data break points (boundaries) in the 
selected data blocks. The video may also be disrupted 
when the angle is changed because inter-frame corre- 

15 lations are used in the coding process. The 
group_of_pictures GOP processing unit of the MPEG 
standard contains at least one refresh frame, and closed 
processing not referencing frames belonging to another 
GOP is possible within this GOP processing unit. 

20 [0291] In other words, if the desired angle data, e. g., 
MA3, is selected before reproduction reaches the multi- 
angle scene period, and at the latest by the time repro- 
duction of the preceding common angle data BA1 is 
completed, the angle data selected from within the multi- 

25 angle scene period can be seamlessly reproduced. 
However, it is extremely difficult while reproducing one 
angle to select and seamlessly reproduce another angle 
within the same multi-angle scene period. It is therefore 
difficult when in a multi-angle scene period to dynami- 

30 cally select a different angle unit presenting, for exam- 
ple, a view from a different camera angle. 

Flow chart: encoder 

35 [0292] The encoding information table generated by 
the encoding system controller 200 from information ex- 
tracted from the scenario data St7 is described below 
referring to Fig. 27. 

[0293] The encoding information table contains VOB 
40 set data streams containing plural VOB corresponding 
to the scene periods beginning and ending at the scene 
branching and connecting points, and VOB data 
streams corresponding to each scene. These VOB set 
data streams shown in Fig. 27 are the encoding infor- 
ms mation tables generated at step #100 in Fig. 34 by the 
encoding system controller 200 for creating the DVD 
multimedia stream based on the user-defined title con- 
tent. 

[0294] The user-defined scenario contains branching 
50 points from common scenes to plural scenes, or con- 
nection points to other common scenes. The VOB cor- 
responding to the scene period delimited by these 
branching and connecting points is a VOB set, and the 
data generated to encode a VOB set is the VOB set data 
55 stream. The title number specified by the VOB set data 
stream is the title number TITLEJMO of the VOB set da- 
ta stream. 

[0295] The VOB Set data structure in Fig. 27 shows 
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the data content for encoding one VOB set in the VOB 
set data stream, and comprises: the VOB set number 
VOBSJvJO, the VOB number VOB_NO in the VOB set, 
the preceding VOB seamless connection flag 
VOB_Fsb, the following VOB seamless connection flag 
VOB_Fsf, the multi-scene flag VOB_Fp, the interleave 
flag VOB_Fi, the multi-angle flag VOB_Jm, the multi- 
angle seamless switching flag VOB_FsV, the maximum 
bit rate of the interleaved VOB ILV_BR, the number of 
interleaved VOB divisions ILV_DIV, and the minimum in- 
terleaved unit presentation time ILVU_MT. 
[0296] The VOB set number VOBS_NO is a sequen- 
tial number identifying the VCB set and the position of 
the VOB set in the reproduction sequence of the title 
scenario. 

[0297] The VOB number VOB_NO is a sequential 
number identifying the VOB and the position of the VOB 
in the reproduction sequence of the title scenario. 
[0298] The preceding VOB seamless connection flag 
VOB_Fsb indicates whether a seamless connection 
with the preceding VOB is required for scenario repro- 
duction. 

[0299] The following VOB seamless connection flag 
VOB_Fsf indicates whether there is a seamless connec- 
tion with the following VOB during scenario reproduc- 
tion. 

[0300] The multi-scene flag VOB_Fp identifies wheth- 
er the VOB set comprises plural video objects VOB. 
[0301] The interleave flag VOB_Fi identifies whether 
the VOB in the VOB set are interleaved. 
[0302] Themulti-angleflag VOB_Fm identifies wheth- 
er the VOB set is a multi-angle set. 
[0303] The multi-angle seamless switching flag 
VOB_FsV identifies whether angle changes within the 
multi-angle scene period are seamless or not. 
[0304] The maximum bit rate of the interleaved VOB 
ILV_BR defines the maximum bit rate of the interleaved 
VOBs. 

[0305] The number of interleaved VOB divisions 
ILV_DIV identifies the number of interleave units in the 
interleaved VOB. 

[0306] The minimum interleave unit presentation time 
ILVU JvIT defines the time that can be reproduced when 
the bit rate of the smallest interleave unit at which a track 
buffer data underflow state does not occur is the maxi- 
mum bit rate of the interleaved VCB ILV_BR during in- 
terleaved block reproduction. 

[0307] The encoding information table for each VOB 
generated by the encoding system controller 200 based 
on the scenario data St7 is described below referring to 
Fig. 28. The VOB encoding parameters described below 
and supplied to the video encoder 300, audio encoder 
700, and system encoder 900 for stream encoding are 
produced based on this encoding information table. 
[0308] The VOB data streams shown in Fig. 28 are 
the encoding information tables generated at step #1 00 
in Fig. 34 by the encoding system controller 200 for cre- 
ating the DVD multimedia stream based on the user- 



defined title content. 

[0309] The encoding unit is the video object VOB, and 
the data generated to encode each video object VOB is 
the VOB data stream. For example, a VOB set compris- 
5 ing three angle scenes comprises three video objects 
VOB. The data structure shown in Fig. 28 shows the 
content of the data for encoding one VCB in the VOB 
data stream. 

[031 0] The VOB data structure contains the video ma- 
10 terial start time VOB_VST, the video material end time 
VOB_VEND, the video signal type VOB_V_KIND, the 
video encoding bit rate V_BR, the audio material start 
time VOB_AST,the audio material end time 
VOB_AEND, the audio coding method VOB_A_KIND, 
15 and the audio encoding bit rate A_BR. 

[031 1 ] The video material start time VOB_VST is the 
video encoding start time corresponding to the time of 
the video signal. 

[031 2] The video material end time VOB_VEND is the 
20 video encoding end time corresponding to the time of 
the video signal. 

[0313] The video material type VOB_V_KIND identi- 
fies whether the encoded material is in the NTSC or PAL 
format, for example, or is photographic materia! (a mov- 

25 ie, for example) converted to a television broadcast for- 
mat (so-called telecine conversion). 
[031 4] The video encoding bit rate V_BR is the bit rate 
at which the video signal is encoded. 
[031 5] The audio material start time VOB_AST is the 

30 audio encoding start time corresponding to the time of 
the audio signal. 

[031 6] The audio material end time VOB_AEND is the 
audio encoding end time corresponding to the time of 
the audio signal. 
35 [0317] The audio coding method VOB_A_KIND iden- 
tifies the audio encoding method as AC-3, MPEG, or lin- 
ear PCM, for example. 

[0318] The audio encoding bit rate AJ3R is the bit rate 
at which the audio signal is encoded. 

40 [0319] The encoding parameters used by the video 
encoder 300, sub-picture encoder 500, and audio en- 
coder 700, and system encoder 900 for VOB encoding 
are shown in Fig. 29. The encoding parameters include: 
the VOB number VOB_NO, video encode start time 

45 V_STTM, video encode end time V_ENDTM, the video 
encode mode V_ENCMD, the video encode bit rate 
V_RATE, the maximum video encode bit rate 
VJvlRATE, the GOP structure fixing flag GOP_Fxflag, 
the video encode GOP structure GOPST, the initial vid- 

50 eo encode data VJNTST, the last video encode data 
V.ENDST, the audio encode start time A_STTM, the au- 
dio encode end time A_ENDTM, the audio encode bit 
rate A_RATE, the audio encode method AJENCMD, the 
audio start gap A_STGAP, the audio end gap 

55 A_ENDGAP, the preceding VOB number B_VOB_NO, 
and the following VOB number F_VOB_NO. 
[0320] The VOB number VOBJMO is a sequential 
number identifying the VOB and the position of the VOB 
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in the reproduction sequence of the title scenario. 
[0321] The video encode start time V_STTM is the 
start time of video material encoding. 
[0322] The video encode end time V_ENDTM is the 
end time of video material encoding. 5 
[0323] The video encode mode V_ENCMD is an en- 
coding mode for declaring whether reverse telecin e con- 
version shall be accomplished during video encoding to 
enable efficient coding when the video material is tele- 
cine converted material. 

[0324] The video encode bit rate V_RATE is the av- 
erage bit rate of video encoding. 
[0325] The maximum video encode bit rate V_M RATE 
is the maximum bit rate of video encoding. 
[0326] The GOP structure fixing flag GOP_Fxflag 
specifies whether encoding is accomplished without 
changing the GOP structure in the middle of the video 
encoding process. This is a useful parameterfor declar- 
ing whether seamless switch is enabled in a multi-angle 
scene period. 

[0327] The video encode GOP structure GOPST is 
the GOP structure data from encoding. 
[0328] The initial video encode data VJNTST sets the 
initial value of the VBV buffer (decoder buffer) at the start 
of video encoding, and is referenced during video de- 
coding to initialize the decoding buffer. This is a useful 
parameterfor declaring seamless reproduction with the 
preceding encoded video stream. 
[0329] The last video encode data V_ENDST sets the 
end value of the VBV buffer (decoder buffer) at the end 
of video encoding, and is referenced during video de- 
coding to initialize the decoding buffer. This is a useful 
parameter for declaring seamless reproduction with the 
preceding encoded video stream. 
[0330] The audio encode start time A_STTM is the 
start time of audio material encoding. 
[0331] The audio encode end time A_ENDTM is the 
end time of audio material encoding. 
[0332] The audio encode bit rate AERATE is the bit 
rate used for audio encoding. 

[0333] The audio encode method A_ENCMD identi- 
fies the audio encoding method as AC-3, MPEG, or lin- 
ear PCM, for example. 

[0334] The audio start gap A_STGAP is the time off- 
set between the start of the audio and video presenta- 
tion at the beginning of a VOB. This is a useful param- 
eter for declaring seamless reproduction with the pre- 
ceding encoded system stream. 
[0335] The audio end gap A_ENDGAP is the time off- 
set between the end of the audio and video presentation 
at the end of a VOB. This is a useful parameter for de- 
claring seamless reproduction with the preceding en- 
coded system stream. 

[0336] The preceding VOB number B_VOB_NO is the 
VOB_NO of the preceding VOB when there is a seam- 
lessly connected preceding VOB. 
[0337] The following VOB number F_VOB_NO is the 
VOB_NO of the following VOB when there is a seam- 



lessly connected following VOB. 
[0338] The operation of a DVD encoder ECD accord- 
ing to the present invention is described below with ref- 
erence to the flow chart in Fig. 34. Note that the steps 
shown with a double line are subroutines. It should be 
obvious that while the operation described below relates 
specifically in this case to the DVD encoder ECD of the 
present invention, the operation described also applies 
to an authoring encoder EC. 

[0339] At step #1 00, the user inputs the editing com- 
mands according to the user-defined scenario while 
confirming the content of the multimedia source data 
streams St1 , St2, and St3. 

[0340] At step #200, the scenario editor 100 gener- 
ates the scenario data St7 containing the above edit 
command information according to the user's editing in- 
structions. 

[0341] When generating the scenario data St7 in step 
#200, the user editing commands related to multi-angle 
and parental lock multi-scene periods in which interleav- 
ing is presumed must be input to satisfy the following 
conditions. 

[0342] First, the VOB maximum bit rate must be set 
to assure sufficient image quality, and the track buffer 
capacity, jump performance, jump time, and jump dis- 
tance of the DVD decoder DCD used as the reproduc- 
tion apparatus of the DVD encoded data must be deter- 
mined. Based on these values, the reproduction time of 
the shortest interleaved unit is obtained from equations 
3 and 4. Based on the reproduction time of each scene 
in the multi-scene period, it must then be determined 
whether equations 5 and 6 are satisfied. If equations 5 
and 6 are not satisfied, the user must change the edit 
commands until equations 5 and 6 are satisfied by, for 
example, connecting part of the following scene to each 
scene in the multi-scene period. 
[0343] When multi-angle edit commands are used, 
equation 7 must be satisfied for seamless switching, and 
edit commands matching the audio reproduction time 
with the reproduction time of each scene in each angle 
must be entered. If non-seamless switching is used, the 
user must enter commands to satisfy equation 8. 
[0344] At step #300, the encoding system controller 
200 first determines whether the target scene is to be 
seamlessly connected to the preceding scene based on 
the scenario data St7. 

[0345] Note that when the preceding scene period is 
a multi-scene period comprising plural scenes but the 
presently selected target scene is a common scene (not 
in a multi-scene period), a seamless connection refers 
to seamlessly connecting the target scene with any one 
of the scenes contained in the preceding multi-scene 
period. When the target scene is a multi-scene period, 
a seamless connection still refers to seamlessly con- 
necting the target scene with any one of the scenes from 
the same multi-scene period. 

[0346] If step #300 returns NO, i.e., a non-seamless 
connection is valid, the procedure moves to step #400. 
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[0347] At step #400, the encoding system controller 
200 resets the preceding VOB seamless connection flag 
VOB_Fsb indicating whether there is a seamless con- 
nection between the target and preceding scenes. The 
procedure then moves to step #600. 
[0348] On the other hand, if step #300 returns YES, i. 
e., there is a seamless connection to the preceding 
scene, the procedure moves to step #500. 
[0349] At step #500 the encoding system controller 
200 sets the preceding VOB seamless connection flag 
VOB_Fsb. The procedure then moves to step #600. 
[0350] At step #600 the encoding system controller 
200 determines whether there is a seamless connection 
between the target and following scenes based on sce- 
nario data St7. If step #600 returns NO, i.e., a non-seam- 
less connection is valid, the procedure moves to step 
#700. 

[0351] At step #700, the encoding system controller 
200 resets the following VOB seamless connection flag 
VOB__Fsf indicating whether there is a seamless con- 
nection with the following scene. The procedure then 
moves to step #900. 

[0352] However, if step #600 returns YES, i.e., there 
is a seamless connection to the following scene, the pro- 
cedure moves to step #800. 

[0353] At step #800 the encoding system controller 
200 sets the following VOB seamless connection flag 
VOB_Fsf . The procedure then moves to step #900. 
[0354] At step #900 the encoding system controller 
200 determines whether there is more than connection 
target scene, i.e., whether a multi-scene period is se- 
lected, based on the scenario data St7. As previously 
described, there are two possible control methods in 
multi-scene periods: parental lock control whereby only 
one of plural possible reproduction paths that can be 
constructed from the scenes in the multi-scene period 
is reproduced, and multi-angle control whereby the re- 
production path can be switched within the multi-scene 
period to present different viewing angles. 
[0355] If step #900 returns NO, i.e., there are not mul- 
tiple scenes, the procedure moves to step #1 000. 
[0356] At step #1000 the multi-scene flag VOB_Fp 
identifying whether the VCB set comprises plural video 
objects VOB (a multi-scene period is selected) is reset, 
and the procedure moves to step #1 800 for encode pa- 
rameter production. This encode parameter production 
subroutine is described below. 

[0357] However, if step #900 returns YES, there is a 
multi-scene connection, the procedure moves to step 
#1100. 

[0358] At step #1100, the multi-scene flag VOB_Jp is 
set, and the procedure moves to step #1200 whereat it 
is judged whether a multi-angle connection is selected, 
or not. 

[0359] At step #1200 it is determined whether a 
change is made between plural scenes in the multi- 
scene period, i.e., whether a multi-angle scene period 
is selected. If step #1200 returns NO, i.e., no scene 



change is allowed in the multi-scene period as parental 
lock control reproducing only one reproduction path has 
been selected, the procedure moves to step #1300. 
[0360] At step #1300 the multi-angle flag VOB_Fm 
5 identifying whether the target connection scene is a mul- 
ti-angle scene is reset, and the procedure moves to step 
#1302. 

[0361] At step #1302 it is determined whether either 
the preceding VOB seamless connection flag VOB_Fsb 

10 or following VOB seamless connection flag VOB_Fsf is 
set. If step #1 302 returns YES , i.e. , the target connection 
scene seamlessly connects to the preceding, the follow- 
ing, or both the preceding and following scenes, the pro- 
cedure moves to step #1304. 

15 [0362] At step #1 304 the interleave flag VOB_.Fi iden- 
tifying whether the VOB, the encoded data of the target 
scene, is interleaved is set. The procedure then moves 
to step #1800. 

[0363] However, if step #1302 returns NO, i.e., thetar- 
20 get connection scene does not seamlessly connect to 
the preceding or following scene, the procedure moves 
to step #1306. 

[0364] At step #1 306 the interleave flag VC3_Fi is re- 
set, and the procedure moves to step #1800. 
25 [0365] If step #1200 returns YES, however, i. e., there 
is a multi-angle connection, the procedure moves to 
step #1400. 

[0366] At step #1400, the multi-angle flag VOB_Fm 
and interleave flag VOB_Fi are set, and the procedure 

30 moves to step #1 500. 

[0367] At step #1 500 the encoding system controller 
200 determines whether the audio and video can be 
seamlessly switched in a multi-angle scene period, i.e., 
at a reproduction unit smaller than the VOB, based on 

35 the scenario data St7. If step #1500 returns NO, i.e., 
non-seamless switching occurs, the procedure moves 
to step #1600. 

[0368] At step #1600 the multi-angle seamless 
switching flag VOB_FsV indicating whether angle 
40 changes within the multi-angle scene period are seam- 
less or not is reset, and the procedure moves to step 
#1800. 

[0369] However, if step #1500 returns YES, i.e., 
seamless switching occurs, the procedure moves to 
45 step #1700. 

[0370] At step #1700 the multi-angle seamless 
switching flag VOB_FsV is set, and the procedure 
moves to step #1800. 

[0371] Therefore, as shown by the flow chart in Fig. 

50 51, encode parameter production (step #1800) is only 
begun after the editing information is detected from the 
above flag settings in the scenario data St7 reflecting 
the user-defined editing instructions. 
[0372] Based on the user-defined editing instructions 

55 detected from the above flag settings in the scenario da- 
ta St7, information is added to the encoding information 
tables for the VOB Set units and VOB units as shown in 
Figs. 27 and 28 to encode the source streams, and the 
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encoding parameters of the VOB data units shown in 
Fig. 29 are produced, in step #1 800. The procedure then 
moves to step #1 900 for audio and video encoding. 
[0373] The encode parameter production steps (step 
#1 800) are described in greater detail below referring to 5 
Figs. 52, 53, 54, and 55. 

[0374] Based on the encode parameters produced in 
step #1 800, the video data and audio data are encoded 
in step #1 900, and the procedure moves to step #2000. 
[0375] Note that the sub-picture data is normally in- 
serted during video reproduction on an as-needed ba- 
sis, and contiguity with the preceding and following 
scenes is therefore not usually necessary. Moreover, 
the sub-picture data is normally video information for 
one frame, and unlike audio and video data having an 
extended time-base, sub-picture data is usually static, 
and is not normally presented continuously. Because 
the present invention relates specifically to seamless 
and non-seamless contiguous reproduction as de- 
scribed above, description of sub-picture data encoding 
is omitted herein for simplicity. 

[0376] Step #2000 is the last step in a loop comprising 
steps #300 to step #2000, and causes this loop to be 
repeated as many times as there are VOB Sets. This 
loop formats the program chain VTS_PGC#i to contain 
the reproduction sequence and other reproduction in- 
formation for each VOB in the title (Fig. 16) in the pro- 
gram chain data structure, interleaves the VOB in the 
multi-scene periods, and completes the VOB Set data 
stream and VOB data stream needed for system stream 
encoding. The procedure then moves to step #2100. 
[0377] At step #21 00 the VOB Set data stream is com- 
pleted as the encoding information table by adding the 
total number of VOB Sets VOBSJvJUM obtained as a 
result of the loop through step #2000 to the VCB Set 
data stream, and setting the number of titles TITLE_NO 
defining the number of scenario reproduction paths in 
the scenario data St7. The procedure then moves to 
step #2200. 

[0378] System stream encoding producing the VOB 
(VOB#i) data in the VTS title VOBS (VTSTTVOBS) 
(Fig. 16) is accomplished in step #2200 based on the 
encoded video stream and encoded audio stream out- 
put from step #1 900, and the encode parameters in Fig. 
29. The procedure then moves to step #2300. 
[0379] At step #2300 the VTS information VTSI, VTSI 
management table VTSI_M AT, VTSPGC information ta- 
ble VTS_PGCIT, and the program chain information 
VTS_PGCI#i controlling the VOB data reproduction se- 
quence shown in Fig. 16 are produced, and formatting 
to, for example, interleave the VOB contained in the mul- 
ti-scene periods, is accomplished. The specific steps 
executed in this formatting operation are described be- 
low with reference to Figs. 49, 50, 51 , 52, and 53. 
[0380] The encode parameter production subroutine 
shown as step #1 800 in Fig. 34B is described next using 
Figs. 52, 53, and 54 using by way of example the oper- 
ation generating the encode parameters for multi-angle 



control. 

[0381] Starting from Fig. 35, the process for generat- 
ing the encode parameters of a non-seamless switching 
stream with multi-angle control is described first. This 
stream is generated when step #1 500 in Fig. 34 returns 
NO and the following flags are set as shown: VOB_Fsb 
= 1 or VOB_Fsf = 1 , VOB_Fp = 1 , VOB_Fi = 1 , VOB_Fm 
= 1 , and VOB_FsV = 0. The following operation produc- 
es the encoding information tables shown in Fig. 27 and 
Fig. 28, and the encode parameters shown in Fig. 29. 
[0382] At step #1812, the scenario reproduction se- 
quence (path) contained in the scenario data St7 is ex- 
tracted, the VOB Set number VOBS JMO is set, and the 
VOB number VOB_NO is set for one or more VOB in 
the VOB Set. 

[0383] At step #1 81 4 the maximum bit rate ILV_BR of 
the interleaved VOB is extracted from the scenario data 
St7, and the maximum video encode bit rate V_MRATE 
from the encode parameters is set based on the inter- 
leave flag VOB_Fi setting (= 1). 
[0384] At step #1816, the minimum interleaved unit 
presentation time ILVLLMT is extracted from the sce- 
nario data St7. 

[0385] At step #1818, the video encode GOP struc- 
ture GOPST values N = 15 and M = 3 are set, and the 
GOP structure fixing flag GOP_Fxflag is set (= 1), based 
on the multi-scene flag VOB__Fp setting (= 1). 
[0386] Step #1 820 is the common VOB data setting 
routine, which is described below referring to the flow 
chart in Fig. 36. This common VOB data setting routine 
produces the encoding information tables shown in 
Figs. 27 and 28, and the encode parameters shown in 
Fig. 29. 

[0387] At step #1822 the video material start time 
VOB_VST and video material end time VOB_VEND are 
extracted for each VOB, and the video encode start time 
V_STTM and video encode end time V_ENDTM are 
used as video encoding parameters. 
[0388] At step #1824 the audio material start time 
VOB_AST of each VOB is extracted from the scenario 
data St7, and the audio encode start time A_STTM is 
set as an audio encoding parameter. 
[0389] At step #1826 the audio material end time 
VOB_AEND is extracted for each VOB from the scenar- 
io data St7, and at a time not exceeding the VOB_AEND 
time. This time extracted at an audio access unit (AAU) 
is set as the audio encode end time A_ENDTM which is 
an audio encoding parameter. Note that the audio ac- 
cess unit AAU is determined by the audio encoding 
method. 

[0390] At step #1828 the audio start gap A_STGAP 
obtained from the difference between the video encode 
start time V_STTM and the audio encode start time 
A_STTM is defined as a system encode parameter. 
[0391 ] At step #1 830 the audio end gap A_ENDGAP 
obtained from the difference between the video encode 
end time V_ENDTM and the audio encode end time 
A_ENDTM is defined as a system encode parameter. 
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[0392] At step #1832 the video encoding bit rate 
V_BR is extracted from the scenario data St7, and the 
video encode bit rate V_RATE, which is the average bit 
rate of video encoding, is set as a video encoding pa- 
rameter. 

[0393] At step #1834 the audio encoding bit rate 
A_BR is extracted from the scenario data St7, and the 
audio encode bit rate A_RATE is set as an audio encod- 
ing parameter. 

[0394] At step #1836 the video material type 
VOB_V_KIND is extracted from the scenario data St7. 
If the material is a film type, i.e., a movie converted to 
television broadcast format (so-called telecine conver- 
sion), reverse telecine conversion is set for the video 
encode mode V_ENCMD, and defined as a video en- 
coding parameter. 

[0395] At step #1838 the audio coding method 
VOB__A_KIND is extracted from the scenario data St7, 
and the encoding method is set as the audio encode 
method A_ENCMD and set as an audio encoding pa- 
rameter. 

[0396] At step #1840 the initial video encode data 
VJNTST sets the initial value of the VBV buffer to a val- 
ue less than the VBV buffer end value set by the last 
video encode data V_ENDST, and defined as a video 
encoding parameter. 

[0397] At step #1842 the VOB number VOBJMO of 
the preceding connection is set to the preceding VOB 
number B_VOB_NO based on the setting (= 1) of the 
preceding VOB seamless connection flag VOB_Fsb, 
and set as a system encode parameter. 
[0398] At step #1844 the VOB number VOBJMO of 
the following connection is set to the following VOB 
number F_VOB_NO based on the setting (= 1) of the 
following VOB seamless connection flag VOB_Fsf , and 
set as a system encode parameter. 
[0399] The encoding information table and encode 
parameters are thus generated for a multi-angle VOB 
Set with non-seamless multi-angle switching control en- 
abled. 

[0400] The process for generating the encode param- 
eters of a seamless switching stream with multi-angle 
control is described below with reference to Fig. 37. This 
stream is generated when step #1500 in Fig. 34 returns 
YES and the following flags are set as shown: VOB_Fsb 
= 1 or VOB.Fsf = 1 , VOB_Fp = 1 , VOB JF\ = 1 , VOB_Fm 
= 1 , and VOB_FsV = 1 . The following operation produc- 
es the encoding information tables shown in Fig. 27 and 
Fig. 28, and the encode parameters shown in Fig. 29. 
[0401] The following operation produces the encod- 
ing information tables shown in Fig. 27 and Fig. 28, and 
the encode parameters shown in Fig. 29. 
[0402] At step #1850, the scenario reproduction se- 
quence (path) contained in the scenario data St7 is ex- 
tracted, the VOB Set number VOBS_NO is set, and the 
VOB number VOB_NO is set for one or more VOB in 
the VOB Set. 

[0403] At step #1 852 the maximum bit rate ILV_BR of 



the interleaved VOB is extracted from the scenario data 
St7, and the maximum video encode bit rate V_MRATE 
from the encode parameters is set based on the inter- 
leave flag VOB_Fi setting (= 1). 
5 [0404] At step #1854, the minimum interleaved unit 
presentation time ILVUJMT is extracted from the sce- 
nario data St7. 

[0405] At step #1856, the video encode GOP struc- 
ture GOPST values N = 15 and M = 3 are set, and the 

10 GO P structure fixing flag GOP_Fxf lag is set (= 1 ), based 
on the multi-scene flag VOB_Fp setting (= 1). 
[0406] At step #1 858, the video encode GOP GOPST 
is set to "closed GOP" based on the multi-angle seam- 
less switching flag VOB_FsV setting (= 1 ), and the video 

15 encoding parameters are thus defined. 

[0407] Step #1 860 is the common VOB data setting 
routine, which is as described referring to the flow chart 
in Fig. 35. Further description thereof is thus omitted 
here. 

20 [0408] The encode parameters of a seamless switch- 
ing stream with multi-angle control are thus defined for 
a VOB Set with multi-angle control as described above. 
[0409] The process for generating the encode param- 
eters for a system stream in which parental lock control 

25 js implemented is described below with reference to Fig. 
38. This stream is generated when step #1200 in Fig. 
34 returns NO and step #1304 returns YES, i.e., the fol- 
lowing flags are set as shown : VOB_Fsb = 1 or VOB_Fsf 
= 1 , VOB_Fp = 1 , VOB_Fi = 1 , VOB_Fm = 0. The fol- 

30 lowing operation produces the encoding information ta- 
bles shown in Fig. 27 and Fig. 28, and the encode pa- 
rameters shown in Fig, 29. 

[0410] At step #1870, the scenario reproduction se- 
quence (path) contained in the scenario data St7 is ex- 
35 tracted, the VOB Set number VOBS JMO is set, and the 
VOB number VOBJMO is set for one or more VOB in 
the VOB Set. 

[041 1 ] At step #1 872 the maximum bit rate ILV_BR of 
the interleaved VOB is extracted from the scenario data 

40 st7, and the maximum video encode bit rate V_MRATE 
from the encode parameters is set based on the inter- 
leave flag VOB_Fi setting (= 1). 
[0412] At step #1872 the number of interleaved VOB 
divisions ILV_DIV is extracted from the scenario data 

45 St7. 

[0413] Step #1876 is the common VOB data setting 
routine, which is as described referring to the flow chart 
in Fig. 35. Further description thereof is thus omitted 
here. 

50 [0414] The encode parameters of a system stream in 
which parental lock control is implemented are thus de- 
fined for a VOB Set with multi-scene selection control 
enabled as described above. 

[041 5] The process for generating the encode param- 
55 eters for a system stream containing a single scene is 
described below with reference to Fig. 70. This stream 
is generated when step #900 in Fig. 34 returns NO, i.e., 
when VOB_Fp=0. The following operation produces the 
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encoding information tables shown in Fig. 27 and Fig. 
28, and the encode parameters shown in Fig. 29. 
[0416] At step #1880, the scenario reproduction se- 
quence (path) contained in the scenario data St7 is ex- 
tracted, the VOB Set number VOBS_NO is set, and the 
VOB number VOB_NO is set for one or more VOB in 
the VOB Set. 

[0417] At step #1 882 the maximum bit rate ILV_BR of 
the interleaved VOB is extracted from the scenario data 
St7, and the maximum video encode bit rate V_M RATE 
from the encode parameters is set based on the inter- 
leave flag VOB_Fi setting (= 1). 
[0418] Step #1 884 is the common VOB data setting 
routine, which is as described referring to the flow chart 
in Fig. 35. Further description thereof is thus omitted 
here. 

[0419] These flow charts for defining the encoding in- 
formation table and encode parameters thus generate 
the parameters for DVD video, audio, and system 
stream encoding by the DVD formatter. 

Formatter flows 

[0420] The operation of the subroutine executed by 
the DVD formatter shown as step #2300 in Fig. 34B is 
described next with reference to Figs. 49, 59, 51, 52, 
and 53. This formatter subroutine generates the DVD 
multimedia bitstream. 

[0421] The operation of the DVD encoder ECD 11 00 
according to the present invention is described with ref- 
erence to the flow chart in Fig. 49. Note that those steps 
shown in Fig. 49 with a double line are subroutines. 
[0422] At step #2310 the program chain information 
VTS_PGCI is set to the VTSI management table 
VTSLMAT for the number of titles TITLE_NUM based 
on the number of titles TITLE JMUM in the VOB Set data 
stream. 

[0423] At step #2312 it is determined whether multi- 
scene selection control is enabled based on the multi- 
scene flag VOBjFp in the VOB Set data stream. If step 
#2312 returns NO, i.e., multi-scene control is not ena- 
bled, the procedure moves to step #2114. 
[0424] At step #231 4 the operation for coding a single 
scene (VOB) executed by the formatter 1 1 00 of the au- 
thoring encoder EC shown in Fig. 25 is accomplished. 
This routine is described later. 

[0425] If step #2312 returns YES, i.e., multi-scene 
control is enabled, the procedure moves to step #2116. 
[0426] At step #231 6 it is determined whether the in- 
formation is to be interleaved or not based on the inter- 
leave flag VOB_Fi state in the VOB Set data stream. If 
step #2316 returns NO, i.e., the information is not to be 
interleaved, the procedure moves to step #2314. If step 
#2316 returns YES, i.e., the information is to be inter- 
leaved, the procedure moves to step #2318. 
[0427] At step #2318 it is determined whether multi- 
angle control is to be implemented based on the multi- 
angle flag VOB_Fm in the VOB Set data stream. If step 



#2318 returns NO, the parental lock control routine in 
step #2320 is executed. If step #2318 returns YES, the 
procedure moves to step #2322. 
[0428] At step #2320 the operation for formatting the 

5 VOB Set for parental lock control is executed. This sub- 
routine is shown in Fig. 52 and described below. 
[0429] At step #2322 it is determined whether multi- 
angle seamless switching is required based on the mul- 
ti-angle seamless switching flag VOB_FsV. If multi-an- 

10 gle switching is accomplished without seamless switch- 
ing, i.e., with non-seamless switching and step #2322 
returns NO, the procedure moves to step #2326. 
[0430] The multi-angle non-seamless switching con- 
trol routine executed in step #2326 by the formatter 1 1 00 

'5 of the authoring encoder EC in Fig. 25 is described later 
with reference to Fig. 50. 

[0431] If multi-angle switching is accomplished with 
seamless switching control, i.e., step #2322 returns 
YES, the procedure moves to step #2324. 
20 [0432] The multi-angle seamless switching control 
routine executed in step #2324 by the formatter 1100 of 
the authoring encoder EC in Fig. 25 is described later 
with reference to Fig. 51 . 

[0433] The cell playback information (PCG informa- 
25 tion entries C_PBI) of the VTS information VTSI set as 
previously described is then recorded. 
[0434] At step #2330 it is determined whether all VOB 
Sets declared by the VOB Set number VOBS_NUM 
have been processed by the formatter. If NO, control 
30 loops back to step #2312, and the process runs again. 
If YES, all sets have been formatted, the procedure ter- 
minates. 

[0435] Referring to Fig. 50, the multi-angle non-seam- 
less switching control routine executed in step #2326 

35 when step #2322, Fig. 49, returns NO is described. This 
routine defines the interleaved arrangement of the mul- 
timedia bitstream MBS, the content of the cell playback 
information (C_PBI#i) shown in Fig. 16, and the infor- 
mation stored to the navigation pack NV shown in Fig. 

40 20, in the generated DVD multimedia bitstream MBS. 
[0436] At step #2340 based on the multi-angle flag 
VOB_Fm setting (= 1) declaring whether multi-angle 
control is applied in the multi-scene period, the cell block 
mode CBM (Fig. 16) of the cell playback information 

45 blocks C_PBI #i containing the VOB control information 
for each scene is declared according to the position of 
the angle data. For example, the cell block mode C3M 
of the MA1 cell (Fig. 23) is declared as 01 b to indicate 
the beginning of the cell block, the CBM of MA2 is de- 

50 dared as 1 0b to indicate a cell between the first and last 
cells in the block, and the CBM of MA3 is declared as 
11b to indicate the end of the cell block. 
[0437] At step #2342 based on the multi-angle flag 
VOB_Fm setting (= 1) declaring whether multi-angle 

55 control is applied in the multi-scene period, the cell block 
type CBT (Fig. 16) of the cell playback information 
blocks C_PBI #i containing the VOB control information 
for each scene is declared as 01 b to indicate an "angle." 
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[0438] At step #2344 the seamless playback flag SPF 
(Fig. 16)issetto1 in the cell playback information blocks 
C_PBI #i containing the VOB control information for 
each scene based on the preceding VOB seamless con- 
nection flag VOB_Fsb state, which is set to 1 to indicate 
a seamless connection. 

[0439] At step #2346 the STC resetting flag STCDF 
is set to 1 in the cell playback information blocks C_PB! 
#i containing the VOB control information for each scene 
based on the preceding VOB seamless connection flag 
VOB_Fsb state, which is set to 1 to indicate a seamless 
connection. 

[0440] At step #2348 the interleaved allocation flag 
IAF (Fig. 1 6) is set to 1 in the cell playback information 
blocks C_PBI #i containing the VOB control information 
for each scene based on the multi-angle seamless 
switching flag V03_FsV state, which is set to 1 to indi- 
cate interleaving is required. 

[0441] At step #2350 the. location of the navigation 
pack NV (relative sector number from the VOB begin- 
ning) is detected from the title editing unit (VOB below) 
obtained from the system encoder 900 in Fig. 25, the 
navigation pack NV is detected based on the minimum 
interleaved unic presentation time ILVU_MT information 
(a formatter parameter obtained in step #1816, Fig. 35), 
the location of the VOBU expressed as the number of 
sectors from the VOB beginning, for example, is thus 
obtained, and the title editing unit VOB is divided into 
interleave units using VOBU units. 
[0442] For example, if in this example the minimum 
interleaved unit presentation time ILVU_MT is 2 sec and 
the presentation time of one VOBU is 0.5 sec, then the 
VOB is divided into interleave units of 4 VOBU each. 
Note that this allocation operation is applied to the VOB 
constituting each multi-scene data unit. 
[0443] At step #2352 the interleave u nits of each VOB 
obtained from step #2350 are arranged in the cell block 
mode CBM sequence (cell block beginning, middle, and 
end cells) written as the VOB control information for 
each scene in step #2340 to form the interleaved blocks 
as shown in Fig. 71 or 72. The interleaved blocks are 
then added to the VTS title VOBS (VTSTT_VOBS). Us- 
ing the cell block mode CBM declarations above, for ex- 
ample, the angle data MA1 , MA2, and MA3 (Fig. 23) are 
arranged in that sequence. 

[0444] At step #2354 the relative sector number from 
the VOBU start is written to the VOB end pack address 
VOBU_EA (Fig. 20) in the navigation pack NV of each 
VOBU based on the VOBU position information ob- 
tained in step #2350. 

[0445] At step #2356 the first cell VOBU start address 
C_FVOBU_SA and the last cell VOBU start address 
C_LVOBU_SA expressed as the n umber of sectors from 
the beginning of the VTS title VOBS (VTSTT_VOBS) 
are written as the addresses of the navigation packs NV 
of the first and last VOBU in each cell based on the VTS 
title VOBS (VTSTT_VOBS) data obtained in step 
#2352. 



[0446] The angle #i VOBU start address 
NSML_AGL_C1JDSTA - NSML_AGL_C9_DSTA of the 
non-seamless angle information NSML_AGLI (Fig. 20) 
in the navigation pack NV of each VOBU is written at 

5 step #2358. This address is expressed as the relative 
sector number inside the data of the interleaved blocks 
formed in step #2352, and declares the address infor- 
mation (Fig. 50) of the navigation pack NV contained in 
the VOBU of all angle scenes near the presentation start 

10 time of the VOBU being processed. 

[0447] At step #2360 VFFFFFFFh" is written to the 
angle #i VOBU start address N S M L_AG L_C 1 _DSTA - 
NSML_AGL_C9_DSTA of the non-seamless angle in- 
formation NSML_AGLI (Fig. 20) in the navigation pack 

15 NV of each VOBU if the VOBU being processed is the 
last VOBU of each scene in the multi-scene period. 
[0448] This routine thus formats the interleaved 
blocks for multi-angle non-seamless switching control 
in the multi-scene period, and formats the cell control 

20 information as the reproduction control information for 
those multiple scenes. 

[0449] Referring to Fig. 51 , the multi-angle seamless 
switching control routine executed in step #2324 when 
step #2322, Fig. 49, returns YES is described. This rou- 

25 tine defines the interleaved arrangement of the multime- 
dia bitstream MBS, the content of the cell playback in- 
formation (C_PBl#i) shown in Fig. 16, and the informa- 
tion stored to the navigation pack NV shown in Fig. 20, 
in the generated DVD multimedia bitstream MBS. 

30 [0450] At step #2370 based on the multi-angle flag 
VOB_Fm setting (= 1) declaring whether multi-angle 
control is applied in the multi-scene period, the cell block 
mode CBM (Fig. 16) of the cell playback information 
blocks C_PBI #i containing the VOB control information 

35 for each scene is declared according to the position of 
the angle data. For example, the cell block mode CBM 
of the MA1 cell (Fig. 23) is declared as 01b to indicate 
the beginning of the cell block, the CBM of MA2 is de- 
clared as 1 0b to indicate a cell between the first and last 

40 cells in the block, and the CBM of MA3 is declared as 
11b to indicate the end of the cell block. 
[0451] At step #2372 based on the multi-angle flag 
VOB_Fm setting (= 1) declaring whether multi-angle 
control is applied in the multi-scene period, the cell block 

45 type CBT (Fig. 16) of the cell playback information 
blocks C_PBI #i containing the VOB control information 
for each scene is declared as 01 b to indicate an "angle." 
[0452] At step #2374 the seamless playback flag SPF 
(Fig. 1 6) is set to 1 in the cell playback information blocks 

50 C_PBI #i containing the VOB control information for 
each scene based on the preceding VOB seamless con- 
nection flag VOB_Fsb state, which is set to 1 to indicate 
a seamless connection. 

[0453] At step #2376 the STC resetting flag STCDF 
55 js set to 1 in the cell playback information blocks C_PBI 
#i containing the VOB control information for each scene 
based on the preceding VOB seamless connection flag 
VOB_Fsb state, which is set to 1 to indicate a seamless 
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connection. 

[0454] At step #2378 the interleaved allocation flag 
IAF (Fig. 16) is set to 1 in the cell playback information 
blocks C_PBI #i containing the VOB control information 
for each scene based on the multi-angle seamless 
switching flag VOB_FsV state, which is set to 1 to indi- 
cate interleaving is required. 

[0455] At step #2380 the location of the navigation 
pack NV (relative sector number from the VOB begin- 
ning) is detected from the title editing unit (VOB below) 
obtained from the system encoder 900 in Fig. 25, the 
navigation pack NV is detected based on the minimum 
interleaved unit presentation time ILVU_MT information 
(a formatter parameter obtained in step #1 854, Fig. 37), 
the location of the VOBU expressed as the number of 
sectors from the VOB beginning, for example, is thus 
obtained, and the title editing unit VCB is divided into 
interleave units using VOBU units. 
[0456] For example, if in this example the minimum 
interleaved unit presentation time ILVU_MT is 2 sec and 
the presentation time of one VOBU is 0.5 sec, then the 
VOB is divided into interleave units of 4 VOBU each. 
Note that this allocation operation is applied to the VOB 
constituting each multi-scene data unit. 
[0457] At step #2382 the interleave units of each VOB 
obtained from step #2380 are arranged in the cell block 
mode CBM sequence (cell block beginning, middle, and 
end cells) written as the VOB control information for 
each scene in step #2360 to form the interleaved blocks 
as shown in Fig. 71 or 72. The interleaved blocks are 
then added to the VTS title VOBS (VTSTT_VOBS) . Us- 
ing the cell block mode CBM declarations above, for ex- 
ample, the angle data MA1 , MA2, and MA3 (Fig. 23) are 
arranged in that sequence. 

[0458] At step #2384 the relative sector number from 
the VOBU start is written to the VOB end pack address 
VOBU_EA (Fig. 20) in the navigation pack NV of each 
VOBU based on the VOBU position information ob- 
tained in step #2360. 

[0459] At step #2386 the first cell VOBU start address 
C_FVOBU_SA and the last cell VOBU start address 
C_LVOBU_SA expressed as the number of sectors from 
che beginning of the VTS title VOBS (VTSTT_VOBS) 
are written as the addresses of the navigation packs NV 
of the first and last VOBU in each cell based on the VTS 
title VOBS (VTSTT_VOBS) data obtained in step 
#2382. 

[0460] At step #2388 the relative sector number from 
the VOBU start is written to the VOB end pack address 
VOBU_EA (Fig. 20) in the navigation pack NV of each 
VOBU based on the interleave unit data obtained in step 
#2370. 

[0461] The angle #i VOBU start address 
SML_AGL_C1_DSTA - SML_AGL_C9_DSTA of the 
seamless angle information SML_AGLI (Fig. 20) in the 
navigation pack NV of each VOBU is written at step 
#2390. This address is expressed as the relative sector 
number inside the data of the interleaved blocks formed 



in step #2382, and declares the address information of 
the navigation pack NV contained in the VOBU of all an- 
gle scenes with a start time contiguous to the reproduc- 
tion end time of the VOBU being processed. 

5 [0462] At step #2392 M 7FFFFFFFh M is written to the 
angle #i VOBU start address SML_AGL_C1 _DSTA - 
SML_AGL_C9_DSTA of the seamless angle informa- 
tion SML_AGLI (Fig. 20) in the navigation pack NV of 
the VOBU contained in the interleaved unit if the inter- 

10 leave unit arranged in step #2382 is the last interleave 
unit of each scene in the multi-scene period. 
[0463] This routine thus formats the interleaved 
blocks for multi-angle seamless switching control in the 
multi-scene period, and formats the cell control informa- 

15 tion as the reproduction control information for those 
multiple scenes. 

[0464] The parental lock subroutine (step #2320, Fig. 
49) executed when step #231 8 in Fig. 49 returns NO, i. 
e., when it is determined that parental lock control is im- 
20 plemented and not multi-angle control, is described next 
with reference to Fig. 52. 

[0465] The parental lock subroutine described below 
writes the interleave unit arrangement of the multimedia 
bitstream, the content of the PGC information entries 

25 C_PBI #i (cell playback information) shown in Fig. 16, 
and the navigation pack NV information shown in Fig. 
20, to the generated DVD multimedia bitstream. 
[0466] At step #2402 a value "00b" is written to the 
cell block mode CBM (Fig. 16) of the cell playback in- 

30 formation blocks C_PBI #i containing the VOB control 
information for each scene based on the multi-angle flag 
VOB_Fm state, which is set to 0 to indicate that multi- 
angle control is not enabled in the multi-scene period. 
[0467] At step #2404 the seamless playback flag SPF 

35 (Fig. 16) is set to 1 in the cell playback information blocks 
C_PBI #i containing the VOB control information for 
each scene based on the preceding VOB seamless con- 
nection flag VOB_Fsb state, which is set to 1 to indicate 
a seamless connection. 

40 [0468] At step #2406 the STC resetting flag STCDF 
is set to 1 in the cell playback information blocks C_PBI 
#i containing the VOB control information for each scene 
based on the preceding VOB seamless connection flag 
VOB_Fsb state, which is set to 1 to indicate a seamless 

45 connection. 

[0469] At step #2408 the interleaved allocation flag 
IAF (Fig. 16) is set to 1 in the cell playback information 
blocks C_PBI #i containing the VOB control information 
for each scene based cn the multi-angle seamless 

50 switching flag VOB_FsV state, which is set to 1 to indi- 
cate interleaving is required. 

[0470] At step #241 0 the navigation pack NV position 
information (the relative sector number from the VOB 
start) is detected from the title editing unit (VOB) ob- 
55 tained from the system encoder 900 (Fig. 25). The nav- 
igation pack NV is then detected based on the number 
of interleaved VOB divisions ILVJ3IV, a formatter pa- 
rameter obtained in step #1874 in Fig. 38, to obtain the 
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VOBU position information (number of sectors from the 
VOB start), and divide each VOB into the specified 
number of interleave units in VOBU units. 
[0471] At step #2412 the interleave units obtained in 
step #241 0 are then interleaved. For example, the inter- 5 
leave units are arranged in ascending VOB number se- 
quence to create the interleaved blocks as shown in Fig. 
71 or 72, and the interleaved blocks are added to the 
VTS title VOBS (VTS7T.VOBS). 
[0472] At step #241 4 the relative sector number from 
the VOBU start is written to the VOB end pack address 
VOBU_EA (Fig. 20) in the navigation pack NV of each 
VOBU based on the VOBU position information ob- 
tained in step #2186. 

[0473] At step #241 6 the first cell VOBU start address 
C_FVOBU_SA and the last cell VOBU start address 
C_LVOBU_SA expressed as the number of sectors from 
the beginning of the VTS title VOBS (VTSTT_VOBS) 
are written as the addresses of the navigation packs NV 
of the first and last VOBU in each cell based on the VTS 
title VOBS (VTSTT_VOBS) data obtained in step 
#2412. 

[0474] At step #241 8 the relative sector number to the 
last interleave unit pack is written to the ILVU end pack 
address ILVU_EA in the navigation pack NV of the VO- 
BU forming the interleaved units based on the inter- 
leaved unit data obtained from step #2412. 
[0475] At step #2420, the relative sector number in the 
interleaved block data formed in step #2412 is written 
to the next-ILVU start address NTJLVU_SA as the po- 
sition information of the next ILVU in the navigation 
packs NV of the VOBU contained in the interleaved unit 
ILVU. 

[0476] At step #2422 the interleaved unit flag ILVU 
flag is set to 1 in the navigation packs NV of the VOBU 
contained in the interleaved unit ILVU. 
[0477] At step #2424, the Unit END flag of the navi- 
gation pack NV in the last VOBU of the interleaved unit 
ILVU is set to 1 . 

[0478] At step #2426 "FFFFFFFFh" is written to the 
next-ILVU start address NTJLVU_SA of the navigation 
pack NV of the VOBU in the last interleaved unit ILVU 
of each VOB. 

[0479] The operation described above thus formats 
the interleaved blocks to enable parental lock control in 
the multi-scene periods, and formats the control infor- 
mation in the cells, i .e., the cell playback control infor- 
mation for the multi-scene periods. 
[0480] The single scene subroutine executed as step 
#231 4 in Fig. 49 when steps #231 2 or #231 6 return NO, 
i.e. , when the scene is determined to be a single scene 
and not a multi-scene period, is described next using 
Fig. 53. 

[0481] The single scene subroutine described below 
writes the interleave unit arrangement of the multimedia 
bitstream, the content of the PGC information entries 
C_PBI #i (cell playback information) shown in Fig. 16, 
and the navigation pack NV information shown in Fig. 



20, to the generated DVD multimedia bitstream. 
[0482] At step #2430 a value "00b" indicating a "non- 
cell block", ie. , that there is only one cell in the functional 
block, is written to the cell block mode CBM (Fig. 16) of 
the cell playback information blocks C_PBI #i containing 
the VOB control information for each scene based on 
the multi-scene flag VOB_Fp state, which is set to 0 to 
indicate that the scene is a single scene and not part of 
a multi-scene period. 

[0483] At step #2432 the interleaved allocation flag 
IAF (Fig. 16) is set to 0 in the cell playback information 
blocks C_PBI #i containing the VOB control information 
for each scene based on the multi-angle seamless 
switching flag VOB_FsV state, which is set to 0 to indi- 
cate interleaving is not required. 
[0484] At step #2434 the navigation pack NV position 
information (the relative sector number from the VOB 
start) is detected from the title editing unit (VOB) ob- 
tained from the system encoder 900 (Fig. 25), placed in 
the VOBU unit, and added to the VTS title VOBS 
(VTSTT_VOBS), the video and other stream data of the 
multimedia bitstream. 

[0485] At step #2436 the relative sector number from 
the VOBU start is written to the VOB end pack address 
VOBU_EA (Fig. 20) in the navigation pack NV of each 
VOBU based on the VOBU position information ob- 
tained in step #2434. 

[0486] At step #2438 the first cell VOBU start address 
C_FVOBU_SA and the last cell VOBU start address 
C_LVOBU_SA expressed as the n umber of sectors from 
the beginning of and the end of, respectively, the VTS 
title VOBS (VTSTT_VOBS) of the value written as the 
addresses of the navigation packs NV of the first and 
last VOBU in cell based on the VTS citle VOBS 
(VTSTT_VOBS) data obtained in step #2434. 
[0487] At step #2440 the state determined as a result 
of step #300 or #600 in Fig. 34, i.e., whether preceding 
VOB seamless connection flag VOB_Fsb is set to 1 in- 
dicating a seamless connection to the preceding or fol- 
lowing scenes, is evaluated. If step #2440 returns YES, 
the procedure moves to step #2442. 
[0488] At step #2442 the seamless playback flag SPF 
(Fig. 16) is set to 1 in the cell playback information blocks 
C_PBI #i containing the VOB control information for 
each scene based on the preceding VOB seamless con- 
nection flag VOB_Fsb state, which is set to 1 to indicate 
a seamless connection. 

[0489] At step #2444 the STC resetting flag STCDF 
is set to 1 in the cell playback information blocks C_PBI 
#i containing the VOB control information for each scene 
based on the preceding VOB seamless connection flag 
VOB_Fsb state, which is set to 1 . 
[0490] If step #2440 returns NO, i.e., there is not a 
seamless connection to the preceding scene, the pro- 
cedure moves to step #2446. 

[0491] At step #2446 the seamless playback flag SPF 
(Fig. 1 6) is set to 0 in the cell playback information blocks 
C_PBI #i containing the VOB control information for 
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each scene based on the preceding VOB seamless con- 
nection flag VOB_Fsb state, which is set to 0 to indicate 
a non-seamless connection. 

[0492] At step #2448 the STC resetting flag STCDF 
is set to 0 in the cell playback information blocks C_PBI 
#i containing the VOB control information for each scene 
based on the preceding VOB seamless connection flag 
VOB_Fsb state, which is set to 0. 
[0493] The operation described above thus formats a 
multimedia bitstream for a single scene period, and 
records the control information in the cells, i.e., the cell 
playback control information (C_PBI#i, Fig. 16), and the 
information in the navigation pack NV (Fig. 20), to the 
produced DVD multimedia bitstream. 

Decoder flow charts 

Disk-to-stream buffer transfer flow: 

[0494] The decoding information table produced by 
the decoding system controller 2300 based on the sce- 
nario selection data St51 is described below referring to 
Figs. 54 and 55. The decoding Information table com- 
prises the decoding system table shown in Fig. 54, and 
the decoding table shown in Fig. 55. 
[0495] As shown in Fig. 54, the decoding system table 
comprises a scenario information register and a cell in- 
formation register. The scenario information register 
records the title number and other scenario reproduction 
information selected by the user and extracted from the 
scenario selection data St51. The cell information reg- 
ister extracts and records the information required to re- 
produce the cells constituting the program chain PGC 
based on the user-defined scenario information extract- 
ed into the scenario information register. 
[0496] More specifically, the scenario information reg- 
ister contains plural sub-registers, i.e., the angle 
number ANGLE_NO_reg, VTS number VTS_NO_reg, 
PGC number VTS_PGCLNO_reg, audio ID AUDIO_ 
ID_reg, sub-picture ID SPJD^reg, and the system clock 
reference SCR buffer SCR_buffer. 
[0497] The angle number ANGLE_NO_reg stores 
which angle is reproduced when there are multiple an- 
gles in the reproduction program chain PGC. 
[0498] The VTS number VTS_NO_reg records the 
number of the next VTS reproduced from among the plu- 
ral VTS on the disk. 

[0499] The PGC number VTS_PGCI_NO_reg 
records which of the plural program chains PGC present 
in the video title set VTS is to be reproduced for parental 
lock control or other applications. 
[0500] The audio ID AUDIO_ID_reg records which of 
the plural audio streams in the VTS are to be repro- 
duced. 

[0501] The sub-picture ID SPJD__reg records which 
of the plural sub-picture streams is to be reproduced 
when there are plural sub-picture streams in the VTS. 
[0502] The system clock reference SCR buffer 



SCR_buffer is the buffer for temporarily storing the sys- 
tem clock reference SCR recorded to the pack header 
as shown in Fig. 1 9. As described using Fig. 26, this 
temporarily stored system clock reference SCR is out- 
s put to the decoding system controller 2300 as the bit- 
stream control data St63. 

[0503] The cell information register contains the fol- 
lowing sub-registers: the cell block mode CBM_reg, cell 
block type CBT_reg, seamless reproduction flag 
10 SPF_reg, interleaved allocation flag IAF_reg, STC re- 
setting flag STCDF, seamless angle change flag 
SACF_reg, first cell VOBU start address C_FVOBU_ 
SA_reg, and last cell VOBU start address C_LVOBlL 
SA_reg. 

15 [0504] The cell block mode CBM_reg stores a value 
indicating whether plural cells constitute one functional 
block. If there are not plural cells in one functional block, 
CBM_reg stores N_BLOCK. |f plural cells constitute one 
functional block, the value FJDELL is stored as the 

20 CBM_reg value of the first cell in the block, L_CELL is 
stored as the CBM_jeg value of the last cell in the block, 
and BLOCK is stored as the CBM_reg of value all cells 
between the first and last cells in the block. 
[0505] The cell block type CBT_reg stores a value de- 

25 fining the type of the block indicated by the cell block 
mode CBM_reg. If the cell block is a multi-angle block, 
A_BLOCK is stored; if not, N_BLOCK is stored. 
[0506] The seamless reproduction flag SPF_reg 
stores a value defining whether that cell is seamless 

30 connected with the cell or cell block reproduced there- 
before. If a seamless connection is specified, SML is 
stored; if a seamless connection is not specified, NSML 
is stored. 

[0507] The interleaved allocation flag IAF_reg stores 
35 a value identifying whether the ceil exists in a contiguous 
or interleaved block. If the cell is part of a an interleaved 
block, ILVB is stored; otherwise NJLVB is stored. 
[0508] The STC resetting flag STCDF defines wheth- 
er the system time clock STC used for synchronization 
40 must be reset when the cell is reproduced; when reset- 
ting the system time clock STC is necessary, 
STC_RESET is stored; if resetting is not necessary, 
STC_N RESET is stored. 

[0509] The seamless angle change flag SACF_reg 
45 stores a value indicating whether a cell in a multi-angle 
period should be connected seamlessly at an angle 
change. If the angle change is seamless, the seamless 
angle change flag SACF is set to SML; otherwise it is 
set to NSML 

50 [0510] The first cell VOBU start address 
C_FVOBU_SA_reg stores the VOBU start address of 
the first cell in a block. The value of this address is ex- 
pressed as the distance from the logic sector of the first 
cell in the VTS title VOBS (VTSTT_VOBS) as measured 

55 by and expressed (stored) as the number of sectors. 
[0511] The last cell VOBU start address C_ 
LVOBU_SA_reg stores the VOBU start address of the 
last cell in the block. The value of this address is also 



71 



EP 0 920 203 B1 



72 



expressed as the distance from the logic sector of the 
first cell in the VTS title VOBS (VTSTT_VOBS) meas- 
ured by and expressed (stored) as the number of sec- 
tors. 

[0512] The decoding table shown in Fig. 55 is de- 
scribed below. As shown in Fig. 55, the decoding table 
comprises the following registers: information registers 
for non-seamless multi-angle control, information regis- 
ters for seamless multi-angle control, a VOBU informa- 
tion register, and information registers for seamless re- 
production. 

[0513] The information registers for non-seamless 
multi-angle control comprise sub-registers NSML_ 
AGL_C1_DSTA_reg - N S M L_AG L_C9_D STA_reg . 
[0514] N S M L_AG L_C 1 _D STA_reg - NSML_AGL_ 
C9_DSTA_reg record the N M S L_AG L_C 1 _DSTA - 
NMSL_AGL_C9J3STA values in the PCI packet shown 
in Fig. 20. 

[0515] The information registers for seamless multi- 
angle control comprise sub-registers SML_AGL_C1__ 
DSTA_reg - SML_AGL_C9__DSTA_reg. 
[0516] SML_AGL_C1_DSTA_reg - SML_AGL_C9_ 
DSTA_reg record the SML_AGL_C1_DSTA - SML_ 
AGL_C9_DSTA values in the DSI packet shown in Fig. 
20. 

[0517] The VOBU information register stores the end 
pack address VOBU_EA in the DSI packet shown in Fig. 
20. 

[0518] The information registers for seamless repro- 
duction comprise the following sub-registers: an inter- 
leaved unit flag lLVU_flag_reg, Unit END flag 
UNIT_END_flag_reg, Interleaved Unit End Address 
ILVU_EA_reg, Next Interleaved Unit Start Address 
NTJLVU_SA_reg, the presentation start time of the first 
video frame in the VOB (Initial Video Frame Presenta- 
tion Start Time) VOB_V_SPTM_reg, the presentation 
end time of the last video frame in the VOB (Final Video 
Frame Presentation Termination Time) VOB_V_EPTM_ 
reg, audio reproduction stopping time 1 VOB_A_STP_ 
PTM1_reg, audio reproduction stopping time 2 
VOB_A_STP_PTM2_reg, audio reproduction stopping 
period 1 VOB_A_GAP_LEN1_reg, and audio reproduc- 
tion stopping period 2 VO B_A_G A P_L E N 2_re g . 
[0519] The interleaved unit flag ILVU_flag_reg stores 
the value indicating whether the video object unit VOBU 
is in an interleaved block, and stores ILVU if it is, and 
NJLVU if not. 

[0520] The Unit END flag UN IT-EN D_flag_reg stores 
the value indicating whether the video object unit VOBU 
is the last VOBU in the interleaved unit ILVU. Because 
the interleaved unit ILVU is the data unit for continuous 
reading, the UNIT_ENDJIag_reg stores END if the VO- 
BU currently being read is the last VOBU in the inter- 
leaved unit ILVU, and otherwise stores N_END. 
[0521] The Interleaved Unit End Address ILVU_EA_ 
reg stores the address of the last pack in the ILVU to 
which the VOBU belongs if the VOBU is in an inter- 
leaved block. This address is expressed as the number 



of sectors from the navigation pack NV of that VOBU. 
[0522] The Next Interleaved Unit Start Address 
NT_ILVU_SA_reg stores the start address of the next 
interleaved unit ILVU if the VOBU is in an interleaved 

5 block. This address is also expressed as the number of 
sectors from the navigation pack NV of that VOBU. 
[0523] The Initial Video Frame Presentation Start 
Time register VOB_V_SPTM_reg stores the time at 
which presentation of the first video frame in the VOB 

10 starts. 

[0524] The Final Video Frame Presentation Termina- 
tion Time register VOB_V_EPTM_reg stores the time at 
which presentation of the last video frame in the VOB 
ends. 

15 [0525] The audio reproduction stopping time 1 
VOB_A_STP_PTM1_reg stores the time at which the 
audio is to be paused to enable resynchronization, and 
the audio reproduction stopping period 1 VOB_ 
A_GAP_LEN1_reg stores the length of this pause peri- 

20 od. 

[0526] The audio reproduction stopping time 2 
VOB_A_STP_PTM2_reg and audio reproduction stop- 
ping period 2 VOB_A_GAP_LEN2„reg store the same 
values. 

25 [0527] The operation of the DVD decoder DCD ac- 
cording to the present invention as shown in Fig. 26 is 
described next below with reference to the flow chart in 
Fig. 56. 

[0528] At step #31 0202 it is first determined whether 
30 a disk has been inserted. If it has, the procedure moves 
to step #310204. 

[0529] At step #3 1 0204, the vol ume file structure VFS 
(Fig. 21) is read, and the procedure moves to step 
#310206. 

35 [0530] At step #310206, the video manager VMG 
(Fig. 21 ) is read and the video title set VTS to be repro- 
duced is extracted. The procedure then moves to step 
#310208. 

[0531] At step #310208, the video title set menu ad- 
40 dress information VTSM_C_ADT is extracted from the 
VTS information VTSI , and the procedure moves to step 
#310210. 

[0532] At step #310210 the video title set menu 
VTSM_VOBS is read from the disk based on the video 

45 title set menu address information VTSM_C_ADT, and 
the title selection menu is presented. 
[0533] The user is thus able to select the desired title 
from this menu instep #310212. If the titles include both 
contiguous titles with no user-selectable content, and 

50 titles containing audio numbers, sub-picture numbers, 
or multi-angle scene content, the user must also enter 
the desired angle number. Once the user selection is 
completed, the procedure moves to step #310214. 
[0534] At step #310214, the VTS_PGCI #i program 

55 chain (PGC) data block corresponding to the title 
number selected by the user is extracted from the VT- 
SPGC information table VTS_PGCIT, and the proce- 
dure moves to step #310216. 
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[0535] Reproduction of the program chain PGC then 
begins at step #310216. When program chain PGC re- 
production is finished, the decoding process ends. If a 
separate title is thereafter to be reproduced as deter- 
mined by monitoring key entry to the scenario selector, 
the title menu is presented again (step #310210). 
[0536] Program chain reproduction in step #310216 
above is described in further detail below referring to 
Fig. 57. The program chain PGC reproduction routine 
consists of steps #31 030, #31 032, #31 034, and #31 035 
as shown. 

[0537] At step #31030 the decoding system table 
shown in Fig. 54 is defined. The angle number 
ANGLE_NO_jeg, VTS number VTS_NO_reg, PGC 
number VTS_PGCI_NO_reg, audio ID AUDIOJD_reg, 
and sub-picture ID SP_ID_reg are set according to the 
selections made by the user using the scenario selector 
2100. 

[0538] Once the PGC to be reproduced is determined, 
the corresponding cell information (PGC information en- 
tries C_PB! #j) is extracted and the cell information reg- 
ister is defined. The sub-registers therein that are de- 
fined are the cell block mode CBM_reg, cell block type 
CBT_reg, seamless reproduction flag SPF_reg, inter- 
leaved allocation flag IAF_reg, STC resetting flag 
STCDF, seamless angle change flag SACF_reg, first 
cell VOBU start address C_FVOBU_SA__reg, and last 
cell VOBU start address C_LVOBU_SA_reg. 
[0539] Once the decoding system table is defined, the 
process transferring data to the stream buffer (step 
#31032) and the process decoding the data in the 
stream buffer (step #31034) are activated in parallel. 
[0540] The process transferring data to the stream 
buffer (step #31 032) is the process of transferring data 
from the recording medium M to the stream buffer 2400. 
This is, therefore, the processing of reading the required 
data from the recording medium M and inputting the da- 
ta to the stream buffer 2400 according to the user-se- 
lected title information and the playback control informa- 
tion (navigation packs NV) written in the stream. 
[0541] The routine shown as step #31 034 is the proc- 
ess for decoding the data stored to the stream buffer 
2400 (Fig. 26), and outputting the decoded data to the 
video data output terminal 3600 and audio data output 
terminal 3700. Thus, is the process for decoding and 
reproducing the data stored to the stream buffer 2400. 
[0542] Note that step #31 032 and step #31 034 are ex- 
ecuted in parallel. 

[0543] The processing unit of step #31 032 is the cell, 
and as processing one cell is completed, it is determined 
in step #31035 whether the complete program chain 
PGC has been processed. If processing the complete 
program chain PGC is not completed, the decoding sys- 
tem table is defined for the next cell in step #31 030. This 
loop from step #31 030 through step #31 035 is repeated 
until the entire program chain PGC is processed. 
[0544] The stream buffer data transfer process of step 
#31032 is described in further detail below referring to 



Fig. 70. The stream buffer data transfer process (step 
#31032) comprises steps #31040, #31042, #31044, 
#31046, and #31048 shown in the figure. 
[0545] At step #31040 it is determined whether the 
s cell is a multi-angle cell. If not, the procedure moves to 
step #30144. 

[0546] At step #31 044 the non-multi-angle cell decod- 
ing process is executed. 

[0547] However, if step #301 40 returns YES because 
10 the cell is a multi-angle cell , the procedure moves to step 
#30142 where the seamless angle change flag SACF is 
evaluated to determine whether seamless angle repro- 
duction is specified. 

[0548] If seamless angle reproduction is specified, the 
15 seamless multi-angle decoding process is executed in 
step #30146. If seamless angle reproduction is not 
specified, the non-seamless multi-angle decoding proc- 
ess is executed in step #30148. 
[0549] The non-multi-angle cell decoding process 
20 (step #31044, Fig. 62) is described further below with 
reference to Fig. 63. Note that the non-multi-angle cell 
decoding process (step #31044) comprises the steps 
#31050, #31052, and #31054. 

[0550] The first step #31 050 evaluates the interleaved 
25 allocation flag IAF_reg to determine whether the cell is 
in an interleaved block. If it is, the non-multi-angle inter- 
leaved block process is executed in step #31052. 
[0551 ] The non-multi-angle interleaved block process 
(step #31052) processes scene branching and connec- 
30 tion where seamless connections are specified in, for 
example, a multi-scene period. 
[0552] However, if the cell is not in an interleaved 
block, the non-multi-angle contiguous block process is 
executed in step #31054. Note that the step #31054 
35 process is the process executed when there is no scene 
branching or connection. 

[0553] The non-multi-angle interleaved block process 
(step #31052, Fig. 63) is described further below with 
reference to Fig. 64. 
40 [0554] At step #31060 the reading head 2006 is 
jumped to the first cell VOBU start address 
C_FVOBU_SA read from the C_FVOBU_SA_reg regis- 
ter. 

[0555] More specifically, the address data C_ 
45 FVOBU_SA_reg stored in the decoding system control- 
ler 2300 (Fig. 26) is input as bitstream reproduction con- 
trol signal St53 to the reproduction controller 2002. The 
reproduction controller 2002 thus controls the recording 
media drive unit 2004 and signal processor 2008 to 
so move the reading head 2006 to the specified address, 
data is read, error correction code ECC and other signal 
processing is accomplished by the signal processor 
2008, and the cell start VOBU data is output as the re- 
produced bitstream St61 to the stream buffer 2400. The 
55 procedure then moves to step #31062. 

[0556] At step #31 062 the DSI packet data in the nav- 
igation pack NV (Fig. 20) is extracted in the stream buff- 
er 2400, the decoding table is defined, and the proce- 
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dure moves to step #31 064. The registers set in the de- 
coding table are the ILVU_EA_reg, NTJLVU_SA_reg, 
VOB_V_SPTM_reg, VOB_V_EPTM_reg, VOB_A_ 
STP_PTM1_reg, VOB_A_STP__PTM2_reg, VOB_A_ 
GAP_LEN1_reg, and VOB_A_GAP_LEN2_reg. 
[0557] At step #31 064 the data from the first cell VO- 
BU start address C_FVOBU_SA_reg to the ILVU end 
pack address ILVU__EA_reg, i.e., the data for one inter- 
leaved unit ILVU, is transferred to the stream buffer 
2400. The procedure then moves to step #31066. 
[0558] More specifically, the address data ILVU_ 
EA_reg stored in the decoding system controller 2300 
(Fig. 26) is supplied to the reproduction controller 2002. 
The reproduction controller 2002 thus controls the re- 
cording media drive unit 2004 and signal processor 
2008 to read the data to the ILVU_EA_reg address, and 
after error correction code ECC and other signal 
processing is accomplished by the signal processor 
2008, the data for the first ILVU in the cell is output as 
the reproduced bitstream St61 to the stream buffer 
2400. It is thus possible to output the data for one con- 
tiguous interleaved unit ILVU on the recording medium 
M to the stream buffer 2400. 

[0559] At step #31 066 it is determined whether all in- 
terleaved units in the interleaved block have been read 
and transferred. If the interleaved unit ILVU processed 
is the last ILVU in the interleaved block, "0X7FFFFFFF*' 
indicating termination is set to the next-ILVU start ad- 
dress NTJLVU_SA_reg as the next read address. If all 
interleaved units in the interleaved block have thus been 
processed, the procedure moves to step #31068. 
[0560] At step #31 068 the reading head 2006 is again 
jumped to the address NTJLVU_SA_reg of the next in- 
terleave unit to be reproduced, and the procedure loops 
back to step #31 062. Note that this jump is also accom- 
plished as described above, and the loop from step 
#31062 to step #31068 is repeated. 
[0561] However, if step #31066 returns YES, i.e., all 
interleaved unit ILVU in the interleaved block have been 
transferred, step #31 052 terminates. 
[0562] The non-multi-angle interleaved block process 
(step #31052) thus transfers the data of one cell to the 
stream buffer 2400. 

[0563] The non-multi-angle contiguous block process 
is executed in step #31054, Fig. 63, is described further 
below with reference to Fig. 65. 

[0564] At step #31070 the reading head 2006 is 
jumped to the first cell VOBU start address C_ 
FVOBU_SA read from the C_FVOBU_SA_reg register. 
This jump is also accomplished as described above, and 
the loop from step #31072 to step #31076 is initiated. 
[0565] At step #31 072 the DSI packet data in the nav- 
igation pack NV (Fig. 20) is extracted in the stream buff- 
er 2400, the decoding table is defined, and the proce- 
dure moves to step #31 074. The registers set in the de- 
coding table are the VOBU_EA_reg, VOB_ 
V_SPTM_reg, VOB_V_EPTM_reg, VOB_A_STP_ 
PTM1_reg, VOB_A_STP_PTM2_reg, VOB_A_GAP_ 



LEN1 _reg, and VOB_A_GAP_LEN2_reg. 
[0566] At step #31 074 the data from the first cell VO- 
BU start address C_FVOBU_SA_reg to the end pack 
address VOBU_EA_reg, i.e., the data for one video ob- 

5 ject unit VOBU, is transferred to the stream buffer 2400. 
The procedure then moves to step #31 076. The data for 
one video object unit VOBU contiguously arrayed to the 
recording medium M can thus be transferred to the 
stream buffer 2400. 

10 [0567] At step #31 076 it is determined whether all cell 
data has been transferred. If all VOBU in the cell has 
not been transferred, the data for the next VOBU is read 
continuously, and the process loops back to step 
#31070. 

15 [0568] However, if all VOBU data in the cell has been 
transferred as determined by the C_LVOBU_SA_reg 
value in step #31076, the non-multi-angle contiguous 
block process (step #31054) terminates. This process 
thus transfers the data of one cell to the stream buffer 

20 2400. 

[0569] Another method of accomplishing the non- 
multi-angle cell decoding process (step #31 044, Fig. 62) 
is described below with reference to Fig. 64. 
[0570] At step #31080 the reading head 2006 is 
25 jumped to the first cell VOBU start address C_ 
FVOBU_SA_reg, and the first VOBU data in the cell is 
transferred to the stream buffer 2400. The procedure 
then moves to step #31 081 . 

[0571] At step #31 081 the DSI packet data in the nav- 
30 igation pack NV (Fig. 20) is extracted in the stream buff- 
er 2400, the decoding table is defined, and the proce- 
dure moves to step #31 082. The registers set in the de- 
coding table are the SCFLbuffer, VOBU_EA_reg, 
ILVU_flag_reg, UNIT_END_flag_reg, ILVU_EA_reg, 
35 NT_lLVU_SA_reg, VOB_V_SPTM_reg, VOB_V_ 
EPTM_reg, VOB_A_STP_PTM1_reg, VOB_A_STP_ 
PTM2_reg, VOB_A_GAP_LEN1_reg, and VOB_A_ 
GAP_LEN2_reg. 

[0572] At step #31 082 the data from the first cell VO- 
40 BU start address C_FVOBU_SA_reg to the end pack 
address VOBU_EA_reg, i.e., the data for one video ob- 
ject unit VOBU, is transferred to the stream buffer 2400. 
The procedure then moves to step #31083. 
[0573] At step #31 083 is determined whether all cell 
45 VOBU data has been transferred. If it has, the process 
(step #31044) terminates. If it has not, the procedure 
moves to step #31084. 

[0574] At step #31084 it is determined whether the 
VOBU is the last VOBU in the interleaved unit. If not, the 

so process loops back to step #31 081 . If so, the procedure 
advances to step #31085. It is thus possible to transfer 
one cell of data in VOBU units to the stream buffer 2400. 
[0575] The loop from step #31 081 to step #31 084 re- 
peats as described above. 

55 [0576] At step #31 085 it is determined whether the in- 
terleaved unit ILVU is the last in the interleaved block. 
If so, step #31044 terminates. If not, the procedure ad- 
vances to step #31 086. 



t 

77 



EP 0 920 203 B1 



78 



[0577] At step #31086 the reading head 2006 is 
jumped to the address NT_ILVU_SA_reg of the next in- 
terleave unit, and the procedure loops back to step 
#31081 . It is thus possible to transfer the data for one 
cell to the stream buffer 2400. 

[0578] The seamless multi-angle decoding process 
executed in step #31046, Fig. 62, is described below 
referring to Fig. 67. 

[0579] At step #31090 the reading head 2006 is 
jumped to the first cell VOBU start address 
C„FVOBU_SA read from the C_FVOBU_SA_reg regis- 
ter, and the first VOBU data in the cell is transferred to 
the stream buffer 2400. The procedure then moves to 
step #31091. This jump is also accomplished as de- 
scribed above, and the loop from step #31091 to step 
#31095 is initiated. 

[0580] At step #31 091 the DSI packet data in the nav- 
igation pack NV (Fig. 20) is extracted in the stream buff- 
er 2400, the decoding table is defined, and the proce- 
dure moves to step #31 092. The registers set in the de- 
coding table are the ILVU_EA_reg, SML_AGL_ 
C1JDSTA_reg - S M L_AG L_C9_D STA_reg , VOB_V__ 
SPTM_reg, VOB_V_EPTM_reg, VOB_A_STP_ 
PTM1_reg, VOB_A_STP_PTM2_reg, VOB_A„GAP_ 
LEN1_reg, and VO B_A_G A P_L EN2_reg . 
[0581 ] At step #31 092 the data from the first cell VO- 
BU start address C_FVOBU_SA_reg to the ILVU end 
pack address ILVU_EA_reg, i.e., the data for one ILVU, 
is transferred to the stream buffer 2400. The procedure 
then moves to step #31 093. It is thus possible to output 
the data for one contiguous interleaved unit ILVU on the 
recording medium M to the stream buffer 2400. 
[0582] At step #31 093 the ANGLE_NO_reg is updat- 
ed, and the procedure moves to step #31094. This up- 
date operation resets the ANGLE„NO_reg to the angle 
number of the angle selected by the user when the user 
changes the angle using the scenario selector 2100 
(Fig. 26). 

[0583] At step #31 094 it is determined whetherthe an- 
gle cell data has all been transferred. If all ILVU in the 
cell have not been transferred, the procedure moves to 
step #31 095. If all ILVU in the cell have been transferred, 
the process terminates. 

[0584] At step #31095 the reading head 2006 is 
jumped to the next angle (SML_AGL_C#n_reg), and the 
process loops back to step #31091. Note that SML_ 
AGL_C#n_reg is the address of the angle to which the 
ANGLE_NO_reg was updated in step #31093. 
[0585] It is thus possible to transfer the data for the 
angle selected by the user to the stream buffer 2400 in 
ILVU units. 

[0586] The non-seamless multi-angle decoding proc- 
ess is executed in step #30148, Fig. 62, is described 
below referring to Fig. 68. 

[0587] At step #31100 the reading head 2006 is 
jumped to the first cell VOBU start address C_ 
FVOBU^SA read from the C_FVOBU_SA_reg register, 
and the first VOBU data in the cell is transferred to the 



stream buffer 2400. The procedure then moves to step 
#31101. This jump is also accomplished as described 
above, and the loop from step #31101 to step #31 1 06 is 
initiated. 

5 [0588] At step #31101 the DSI packet data in the nav- 
igation pack NV (Fig. 20) is extracted in the stream buff- 
er 2400, the decoding table is defined, and the proce- 
dure moves to step #31 1 02. The registers set in the de- 
coding table are the VOBU_EA_reg, NSML_AGL_ 

10 C1_DSTA_reg, NSM L_AGL_C9_DSTA_reg , VOB_V_ 
SPTM_reg, VOB_V_EPTM_reg, VOB_A_STP_PTM1_ 
reg, VOB_A_STP_PTM2_reg, VOB_A_GAP_LEN1_ 
reg, and VOB_A_GAPJ_EN2_reg. 
[0589] At step #31 1 02 the data from the first cell VO- 

15 BU start address C_FVOBU_SA_reg to the end pack 
address VOBU_EA_reg, i.e., the data for one VOBU, is 
transferred to the stream buffer 2400. The procedure 
then moves to step #31 1 03. It is thus possible to output 
the data for one contiguous video object unit VOBU on 

20 the recording medium M to the stream buffer 2400. 
[0590] At step #31 1 03 the ANGLE_NO_reg is updat- 
ed, and the procedure moves to step #31104. This up- 
dace operation resets the ANGLE_NO_reg to the angle 
number of the angle selected by the user when the user 

25 changes the angle using the scenario selector 2100 
(Fig. 26). 

[0591] At step #311 04 it is determined whetherthe an- 
gle cell data has all been transferred. If all VOBU in the 
cell have not been transferred, the procedure moves to 
3Q step #31105. If all VOBU in the cell have been trans- 
ferred, the process terminates. 

[0592] At step #31105 the reading head 2006 is 
jumped to the next angle (NSML_AGL_C#n_reg), and 
the process advances to step #31106. Note that 
35 NSML_AGL_C#n_reg is the address of the angle to 
which the ANGLE_NO_reg was updated in step #31 1 03. 
[0593] It is thus possible to transfer the data for the 
angle selected by the user to the stream buffer 2400 in 
VOBU units. 

40 [0594] Step #31 1 06 is an effective step for high speed 
angle switching, and simply clears the stream buffer 
2400. By thus clearing the stream buffer 2400 the data 
for the newly selected angle can be reproduced without 
reproducing the angle data that is still not decoded. In 

45 other words, clearing the stream buffer 2400 enables 
faster response to user operations. 
[0595] It is very important that DVD decoder accord- 
ing to the present invention can promptly moves to the 
next data reading process and effectively performs the 

so data reading once after the detection of the end of data 
such as interleave unit ILVU and video object unit VOBU 
for the sake of seamless reproduction which is one of 
main targets of the present invention. 
[0596] With reference to Fig. 69, a construction of the 

55 stream buffer 2400 which can performs the end detec- 
tion of interleave unit ILVU is described briefly. The 
stream buffer 2400 comprises a VOB buffer 2402, a sys- 
tem buffer 2404, a navigation pack extractor 2406, and 
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a data counter 2408. The system buffer 2404 temporar- 
ily stores the title control data VTSI(Fig. 16) included in 
signal St61, and outputs a control information St2450 
(St63) such as VTS_PGC. 

[0597] The VOB buffer 2402 temporarily stores the ti- 
tle VOB data VTSTT_VOB (Fig. 16), and the stream 
St67 to the system decoder 2500. 
[0598] The NV (navigation pack) extractor 2406 re- 
ceives the VOB data at the same time with the VOB buff- 
er 2402, and extracts the navigation pack NV therefrom. 
The NV extractor 2406 furthermore extracts the VOBU 
final pack address COBU_EA or ILVU final pack address 
ILVU_EA which are the DSI generation information 
DSLGI shown in Fig. 1 9 to produce a pack address in- 
formation St2452 (St63). 

[0599] The data counter 2408 receives the VOB data 
at the same time with the VOB buffer 2402, and counts 
each of pack data shown in Fig. 19 byte by byte. Then, 
the data counter 2408 produces a pack inputterminating 
signal St2454 (St63) at the time when the inputting of 
pack data is completed. 

[0600] Due to its construction shown in Fig. 69, the 
stream buffer 2400 performs the VOBU data transfer as 
examples at step #31064 of Fig. 64, as follows. The 
stream buffer 2400 outputs the VOBU data for the NV 
extractor 2406 and data counter 2408 at the same time 
when the VOBU buffer 2400 receives the VOBI data on 
the top of interleave unit ILVU. As a result, the NV ex- 
tractor 2406 can extracts the data of ILVU_EA and 
NTJLVU_SA at the same time with the inputting of nav- 
igation pack data NV, and outputs thereof as signal 
St2452 (St63) to the decode system controller 2300 
(Fig. 26) 

[0601] The decode system controller 2300 stores the 
signal St2452 into the ILVU_EA_reg and NT_ 
ILVU_SA__reg, and then start to counts the number of 
packs based on the pack terminating signal 2452 from 
the data counter 2408. Based on the fore mentioned the 
counted value of packs and lLVU_EA_reg, the decode 
system controller 2300 detects the instance when the 
inputting of final pack data of ILVU is completed, or the 
inputting final byte data of the final pack of the ILVU is 
completed. Then, the controller 2300 further give a com- 
mand for the bitstream reproducer 2000 to move to the 
position having a sector address indicated by 
NT_ILVU_SA_reg. The bitstream producer 2000 moves 
to the sector address indicated NT_ILVU_SA_reg , and 
starts to read the data. Thus, the detection of final end 
of ILVU and reading process for the next ILVU can be 
performed effectively. 

[0602] In the above, an example where the multime- 
dia data MBS is reproduced by the bitstream reproducer 
2000 without a buffering process, and is inputted to the 
stream buffer 2499. However, in the case that the signal 
processor 2008 of the bitstream reproducer 2000 is in- 
corporated with a buffer for error correction process, for 
example, the controller 2300 gives a moving command 
to reproducer 2000 so that the reproducer 2000 moves 



to the reading position indicated by NT_ILVU_SA_reg 
after completion of the final pack data of fore mentioned 
ILVU and clearing the internal buffer of the reproducer 
2000. Thus, the effective reproduction of ILVU data even 

5 when the bitstream reproducer 2000 includes a buffer 
for error correction code (ECC) process. 
[0603] Furthermore, when the bitstream producer 
2000 has a buffer for ECC process, the data can be 
transferred effectively by providing any means having a 

10 function equivalent to that of data counter 2408 (Fig. 69). 
In other words, the bitstream reproducer 2000 gener- 
ates the pack input completion signal St62; the decode 
system controller 2300 gives a command based on the 
signal St62 the bitstream reproducer 200 to move to the 

15 reading position having sector address designated by 
NTJLVU_SA_reg. As apparent from the above, the da- 
ta can be transferred effectively even when the bit- 
stream reproducer 2000 has afunction to bufferthe data 
reproduced from the recording media M. 

20 [0604] It is to be noted that the apparatus and method 
substantially the same as those described in the above 
with respect to the interleave unit ILVU can be used for 
the detection VOBU end. In other words, by replacing 
the extraction of ILVU_EA and NT_ILVU_Sa, and the 

25 storing of ILVU_EA_reg and NT_ILVU_SA_reg with the 
extraction of VOBU_EA and storing VOBU_EA_reg, the 
apparatus and method according to the present inven- 
tion, described above, can be used for the detection of 
unatend.This is effective for the VOBU data transferring 

30 operations shown at steps #31074, #31082, #31092, 
and #31102. 

[0605] Thus, the reading of data such as ILVU and 
VOBU can be performed effectively. 

35 Decoding flows in the stream buffer 

[0606] The process for decoding data in the stream 
buffer 2400 shown as step #31034 in Fig. 57 is de- 
scribed below referring to Fig. 58. This process (step 
40 #31 034) comprises steps #31 1 1 0, #31 1 1 2, #31 1 1 4, and 
#31116. 

[0607] At step #31 110 data is transferred in pack units 
from the stream buffer 2400 to the system decoder 2500 
(Fig. 26). The procedure then moves to step #31112. 

45 [0608] At step #31 1 1 2 the pack data is from the stream 
buffer 2400 to each of the buffers, i.e., the video buffer 
2600, sub-picture buffer 2700, and audio buffer 2800. 
[0609] At step #31 1 1 2 the Ids of the user-selected au- 
dio and sub-picture data, i.e., the audio ID 

so AUDIOJD_reg and the sub-picture ID SPJD_reg 
stored to the scenario information register shown in Fig. 
54, are compared with the stream ID and sub-stream ID 
read from the packet header (Fig. 1 9), and the matching 
packets are output to the respective buffers. The proce- 

55 dure then moves to step #31114. 

[061 0] The decode timing of the respective decoders 
(video, sub-picture, and audio decoders) is controlled in 
step #31114, i.e., the decoding operations of thedecod- 
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ers are synchronized, and the procedure moves to step 
#31116. 

[061 1 ] The respective elementary strings are then de- 
coded at step #31116. The video decoder 3801 thus 
reads and decodes the data from the video buffer, the 
sub-picture decoder 31 00 reads and decodes the data 
from the sub-picture buffer, and the audio decoder 3200 
reads and decodes the data from the audio buffer. 
[061 2] This stream buffer data decoding process then 
terminates when these decoding processes are com- 
pleted. 

[0613] The decoder synchronization process of step 
#31114, Fig. 58, is described below with reference to 
Fig. 59. This processes comprises steps #31120, 
#31122, and #31124. 

[0614] At step #31120 it is determined whether a 
seamless connection is specified between the current 
cell and the preceding cell. If a seamless connection, 
the procedure moves to step #31122, if not, the proce- 
dure moves to step #31124. 

[061 5] A process synchronizing operation for produc- 
ing seamless connections is executed in step #31122, 
and a process synchronizing operation for non-seam- 
less connections is executed in step #31124. 
[061 6] To achieve seamless multi-scene reproduction 
control, it is necessary to seamlessly reproduce the con- 
nections between VOB. Except when a VOB, which is 
normally a single-stream title editing unit, is divided into 
discrete streams, however, there is no contiguity be- 
tween the SCR and PTS at the connection. The prob- 
lems relating to reproducing VOB of which the SCR and 
PTS are not contiguous are described below. 
[061 7] Note that the presentation time stamp PTS de- 
claring the video presentation start time is referenced 
below as the VPTS, the decoding time stamp DTS de- 
claring the video decode start time is referenced as 
VDTS, and the presentation time stamp PTS declaring 
the audio reproduction, or presentation, start time is ref- 
erenced as APTS below. 

[0618] The relationship between the SCR, APTS, and 
VPTS values and recording positions in the VOB are 
shown in Fig. 47 and described below. For simplification 
this description is limited to the SCR and PTS parame- 
ters. The top SCR value is recorded with the PTS to both 
the middle audio stream and bottom video stream. If the 
positions on the horizontal axis are approximately the 
same, approximately the same SCR value is recorded 
to each stream. 

[061 9] Tse is the time indicated by the SCR of the last 
pack in the VOB; Tve is the time indicated by the VPTS 
of the last video pack in the VOB; Tae is the time indi- 
cated by the APTS of the last audio pack in the VOB; 
Tvd is the video decode buffer delay time; and Tad is 
the audio decode buffer delay time. 
[0620] Fig. 48 is a time line from input of the VOB 
shown in Fig. 47 to the system decoder to output of the 
last audio and video reproduction data. The horizontal 
axis shows the passage of time t, and the vertical axis 



shows the SCR value, which indicates the time transfer 
should be accomplished, and the PTS values, which in- 
dicate the time reproduction should be accomplished. 
[0621] Both audio and video outputs are thus delayed 

5 by the respective decode buffers referenced to the sys- 
tem clock reference SCR, and while the audio and video 
data are input substantially simultaneously, the video 
data is reproduced at a slight delay after the audio data. 
This delay is caused by the difference in the video and 

10 audio decode buffer delay times. 

[0622] In addition, when two VOB are connected, 
there is no contiguity between the SCR and PTS at the 
connection except when a single-stream VOB is split in- 
to separate streams. 

15 [0623] Contiguous reproduction of VOB #1 and VOB 
#2 having non-contiguous SCR and PTS parameters is 
described below referring to Fig. 46. 
[0624] Fig. 46 also shows the relationship between 
the SCR, APTS, and VPTS values and recording posi- 

20 tions in each VOB. 

[0625] The system clock reference SCR is time infor- 
mation indicating the pack transfer time, and is recorded 
with each pack; APTS is the audic playback starting time 
information recorded with each audio packet; and VPTS 

25 j s the video playback starting time information recorded 
with each video packet. The system clock STC is a ref- 
erence clock for decoder synchronization control. 
[0626] Tse1 is the time indicated by the SCR of the 
last pack in VOB #1 ; Tae1 is the time indicated by the 

30 last APTS in VOB #1 ; and Tve1 is the time indicated by 
the last VPTS in VOB #1. 

[0627] Tad is the audio decode buffer delay time; Tvd 
is the video decode buffer delay time; and the horizontal 
axis indicates the passage of time t. 

35 [0628] What is important to note here is that synchro- 
nizing the audio and video so that the reproduced audio 
and video signals are output at the time the system clock 
STC equals the APTS and VPTS values in the stream. 
[0629] To maintain the reference clock for transferring 

40 VOB to the system decoder, the first system clock ref- 
erence SCR value in VOB #2 must be set to the STC 
initializer at precisely time Tse1. However, because re- 
production and output of VOB #1 have not been com- 
pleted at this time point, the audio and video still to be 

45 output from VOB #1 after time Tse1 cannot be normally 
reproduced because the reference clock is lost. 
[0630] Furthermore, even if the SCR value of the STC 
initializer is set at the audio end time Tae1 , i.e. , later than 
the system clock reference SCR time Tse1 , the refer- 

50 ence clock at which the first pack in VOB #2 should be 
transferred is lost, and the reference clock for the VOB 
#1 video output to be reproduced after time Tae1 is lost. 
[0631] The same problem also occurs if the system 
clock reference SCR is set to the STC initializer at time 

55 Tve1. 

[0632] When there is a one-to-one correspondence 
between the VOB reproduced first, i.e., the first VOB, 
and the VOB reproduced thereafter, i.e., the second 
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VOB, this problem can be avoided by assuring that the 
first SCR value of the second VOB is contiguous to the 
last SCR of the first VOB. However, when common data 
is shared between plural titles, there is a many-to-one 
relationship between the first VOB and the many VOB 
that may be reproduced thereafter. When contiguously 
reproducing a second VOB #2 following a first VOB #1 , 
it is therefore necessary to destroy any remaining VOB 
#1 data in the decode buffer at time Tse1 , and seamless 
reproduction in which the audio and video are not inter- 
mitted is not possible. 

[0633] A method(s) for seamlessly reproducing VOB 
of which the SCR and PTS are not contiguous is de- 
scribed by means of the two embodiments of the inven- 
tion below. 

(Synchronization controller: embodiment 1) 

[0634] A first embodiment of the synchronizer 2900 
shown in Fig. 26 is described according to the present 
invention with reference to Fig. 32 below. As shown m 
Fig. 32, the synchronizer 2900 comprises a system 
clock STC generator 2902, PTS/DTS extractor 2904, 
video decoder synchronizer 2906, sub-picture decoder 
synchronizer 2908, audio decoder synchronizer 2910, 
and system decoder synchronizer 2912. 
[0635] The STC generator 2902 generates the sys- 
tem clock for each decoder, and supplies the synchro- 
nization system clock STC to the video decoder syn- 
chronizer 2906, sub-picture decoder synchronizer 
2908, audio decoder synchronizer 2910, and system 
decoder synchronizer 2912. The STC generator 2902 
is described in detail below with reference to Fig. 39. 
[0636] The PTS/DTS extractor 2904 extracts the 
presentation time stamp PTS and decoding time stamp 
DTS from the synchronization control data St81 , and 
supplies the PTS and DTS to the decoder synchroniz- 
ers. 

[0637] The video decoder synchronizer 2906 gener- 
ates the video stream decoding start signal St89 based 
on the STC from the STC generator 2902 and the de- 
coding time stamp DTS for starting video decoding sup- 
plied from the PTS/DTS extractor 2904. More specifi- 
cally, the video decoder synchronizer 2906 generates 
the video stream decoding start signal St89 when the 
STC and DTS match. 

[0638] The sub-picture decoder synchronizer 2908 
generates the sub-picture stream decoding start signal 
St91 based on the STC from the STC generator 2902 
and the decoding time stamp DTS for starting sub-pic- 
ture decoding supplied from the PTS/DTS extractor 
2904. More specifically, the sub-picture decoder syn- 
chronizer 2908 generates the sub-piccure stream de- 
coding start signal St91 when the STC and DTS match. 
[0639] The audio decoder synchronizer 2910 gener- 
ates the audio stream decoding start signal St93 based 
on the STC from the STC generator 2902 and the de- 
coding time stamp DTS for starting audio decoding sup- 



plied from the PTS/DTS extractor 2904. More specifi- 
cally, the audio decoder synchronizer 2910 generates 
the audio stream decoding start signal St93 when the 
STC and DTS match. 

5 [0640] The system decoder synchronizer 2912 out- 
puts the STC from the STC generator 2902 as the sys- 
tem clock St79. The system clock St79 is used to control 
pack transfers from the stream buffers to the system de- 
coder. Inotherwords, if the STC value exceeds the SCR 

10 value in the pack, the pack data is transferred from the 
stream buffer to the system decoder. 
[0641] The structure and operation of the STC gener- 
ator 2902 are described in detail below with reference 
to Fig. 39. As shown in Fig. 39, the STC generator 2902 

15 comprises a system clock STC initializer 32010, STC 
offset calculator 32012, STC counter 32014, STC re- 
generator 32016, STC selection controller 32018, STC 
selector for video decoder 32020, STC selector for sub- 
picture decoder 32022, STC selector for audio decoder 

20 32024, and STC selector for system decoder 32026. 
[0642] The STC offset calculator 3201 2 calculates the 
offset value STCof used to update the system clock STC 
for contiguous reproduction of two VOBs with different 
initial system clock STC values (SCR). 

25 [0643] More specifically, the offset value STCof is cal- 
culated by subtracting the Initial Video Frame Presen- 
tation Start Time VOBJ/_SPTM_reg value of the next 
VOB to be reproduced from the Final Video Frame Pres- 
entation Termination Time VOB_V_EPTM_reg (see Fig. 

30 39) of the VOB reproduced first. 

[0644] The STC counter 3201 4 is a sequential coun- 
ter that counts from a set value synchronized to the sys- 
tem clock, and generates the reference clock STCc for 
each decoder. 

35 [0645] The STC regenerator 32016 outputs a reset 
system clock STCr by subtracting the offset value STCof 
calculated by the STC offset calculator 32012 from the 
reference clock STCc supplied from the STC counter 
32014. 

40 [0646] The system clock STC initializer 320 10 selects 
and sets the SCR from the first pack in the VOB, or the 
reset system clock STCr output from the STC regener- 
ator 3201 6, according to the control signal from the STC 
selection controller 3201 8. The value set by the STC in- 

45 itializer 3201 0 is the initial value used by the STC coun- 
ter 32014. 

[0647] The STC selector for video decoder 32020 se- 
lects either the output STCc from the STC counter 
32014 or the output STCr from the STC regenerator 

so 3201 6 according to the control signal from the STC se- 
lection controller 3201 8, and outputs the selected signal 
to the video decoder synchronizer 2906. 
[0648] The STC selector for sub-picture decoder 
32022 similarly selects either output STCc or output ST- 

55 Cr according to the control signal from the STC selection 
controller 3201 8, and outputs the selected signal to the 
sub-picture decoder synchronizer 2908. 
[0649] The STC selector for audio decoder 32024 
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similarly selects either output STCc or output STCr ac- 
cording to the control signal from the STC selection con- 
troller 3201 8, and outputs the selected signal to the au- 
dio decoder synchronizer 291 0. 
[0650] The STC selector for system decoder 32026 
similarly selects either output STCc or output STCr ac- 
cording to the control signal from the STC selection con- 
troller 3201 8, and outputs the selected signal to the sys- 
tem decoder synchronizer 2912. 
[0651] The operation of the STC selection controller 
32018 during non-seamless reproduction is described 
below with reference to Fig. 60. During non-seamless 
operation (when the SPF_reg flag _ SML), all STC se- 
lectors, i.e., STC selectorforvideo decoder 32020, STC 
selectorfor sub-picture decoder 32022, STC selectorfor 
audio decoder 32024, and STC selector for system de- 
coder 32026, select and output the reference clock 
STCc. More specifically, the STC selectors synchronize 
decoder operation based on the reference clock STCc 
output by the STC counter 3201 4. 
[0652] The operation of the STC selection controller 
32018 during seamless reproduction (when the 
SPF_reg flag = SML) is also described below with ref- 
erence to Fig. 40 and Fig. 61 . 

[0653] Fig. 40 also shows the relationship between 
the SCR, APTS, VDTS, and VPTS values and recording 
positions in each stream when two VOBs #1 and #2 are 
connected and seamless reproduced. 
[0654] The system clock reference SCR is time infor- 
mation indicating the pack transfer time, and is recorded 
with each pack; APTS is the audio pi ay back starting time 
information recorded with each audio packet; VDTS is 
the video decode start time information recorded with 
each video packet; and VPTS is the video playback 
starting time information recorded with each video pack- 
et. The system clock STC is a reference clock for de- 
coder synchronization control. 

[0655] Tse1 (T1) is the time indicated by the SCR of 
the last pack in VOB #1 ; Tae1 (T2) is the time indicated 
by the last APTS in VOB #1 ; Tde1 (T3) is the time indi- 
cated by the last VDTS in VOB #1 ; and Tve1 (T4) is the 
time indicated by the last VPTS in VOB #1 .i.e., the Final 
Video Frame Presentation Termination Time 
VOB_V_EPTM identified by the last VPTS in VOB #1 . 
[0656] Tad is the audio decode buffer delay time; and 
Tvd is the video decode buffer delay time. 
[0657] Tad is the audio decode buffer delay time; Tdd 
is the video decode buffer delay time; and Tve is the 
sum of the video decode buffer delay time and the delay 
until video presentation. 

[0658] Fig. 61 is a flow chart used to describe the op- 
eration of the STC selection controller 32018 shown in 
Fig. 39 during seamless reproduction operation. 
[0659] At step #311220, the STC offset (offset value 
STCof) is calculated, and the procedure moves to step 
#311221. 

[0660] As described above, the STC offset value 
STCof is calculated by subtracting the Initial Video 



Frame Presentation Start Time VOB_V_SPTM_reg val- 
ue of the next VOB to be reproduced from the Final Vid- 
eo Frame Presentation Termination Time VOB_V_ 
EPTM_reg (see Fig. 55) of the VOB reproduced first. 
5 Thus, the total reproduction time of the VOB reproduced 
first is calculated as the offset value STCof of the VOB 
reproduced next. 

[0661 ] At step #31 1 221 the calculated STC offset val- 
ue STCof is applied to the STC regenerator 32016 to 
10 update the system clock STC. The procedure then 
moves to step #311222. 

[0662] The STC regenerator 3201 6 updates the sys- 
tem clock STC by subtracting the output STCof from the 
STC offset calculator 32012 from the reference clock 
15 STCc output from the STC counter 32014 (STCc - 
STCof), and outputs the result as the reset system clock 
STCr. 

[0663] At step #311 222 the reset system clock STCr 
is selectively output by the STC selection controller 

20 3201 8 to the STC selector for system decoder 32026 at 
time T1 (Fig. 40), i.e., at the time SCR changes from 
stream VOB #1 to VOB #2. The procedure then moves 
to step #311 223. Thereafter the reset system clock ST- 
Cr is applied as the system clock STC referenced by the 

25 system decoder, and the transfer timing of VOB #2 to 
the system decoder is determined by the SCR in the 
pack header of the pack and the STCr. 
[0664] At step #31 1 223 the reset system clock STCr 
is selectively output to the STC selector for audio de- 

30 coder 32024 at time T2 (Fig. 40), i.e., at the time APTS 
changes from stream VOB #1 to VOB #2. The procedure 
then moves to step #31 1224. Thereafter the reset sys- 
tem clock STCr is applied as the system clock STC ref- 
erenced by the audio decoder, and the VOB #2 audio 

35 output timing is determined by the APTS in the audio 
packet and the STCr. In other words, when the STCr 
matches the APTS, the audio data corresponding to that 
APTS is reproduced. 

[0665] At step #31 1224 the reset system clock STCr 

40 is selectively output to the STC selector for video de- 
coder 32020 at time T3 (Fig. 40), Let, at the time VDTS 
changes from stream VOB #1 to VOB #2. The procedure 
then moves to step #311225. Thereafter the reset sys- 
tem clock STCr is applied as the system clock STC ref- 

45 erenced by the video decoder, and the VOB #2 video 
decode timing is determined by the VDTS in the video 
packet and the STCr. In other words, when the STCr 
matches the VDTS, the video data corresponding to that 
VDTS is decoded. 

so [0666] At step #311 225 the reset system clock STCr 
is selectively output to the STC selector for sub-picture 
decoder 32022 at time T4 (Fig. 40), i.e., at the time 
VPTS changes from stream VOB #1 to VOB #2. The 
procedure then moves to step #311226. Thereafter the 

55 reset system clock STCr is applied as the system clock 
STC referenced by the sub-picture decoder, and the 
VOB #2 sub-picture presentation timing is determined 
by the presentation time stamp PTS in the sub-picture 
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packet and the STCr. 

[0667] In other words, when the STCr matches the 
PTS, the sub-picture data corresponding to that PTS is 
reproduced. Note that the process from sub-picture de- 
coding to presentation is accomplished instantaneously. 
As a result, the system clock STC value referenced by 
the sub-picture decoder changes at the same timing at 
which the video playback starting time information 
VPTS changes from VOB #1 to VOB #2. 
[0668] At step #311 226 the STCr is reset to the STC 
initializer32010. The STC initializer 32010 thus supplies 
this updated clock to the STC counter 3201 4, which op- 
erates using this reset system clock STCr value as the 
initial value. The procedure then moves to step 
#311227. 

[0669] At step #311227 all of the STC selectors, i.e., 
STC selector for video decoder 32020, STC selector for 
sub-picture decoder 32022, STC selector for audio de- 
coder 32024, and STC selector for system decoder 
32026, output the reference clock STCc. 
[0670] Thereafter, the system clock STC referenced 
by the video decoder, sub-picture decoder, audio decod- 
er, and system decoder is the reference clock STCc out- 
put from the STC counter 32014. 
[0671] The process from step #311226 to step 
#31 1 227 must only be accomplished by the time the sys- 
tem clock reference SCR changes from the VOB #2 
SCR to the first SCR in the VCB following VOB #2, i.e., 
by time T1 at which the change to the next VOB is ac- 
complished. 

[0672] Note that the STC switching time T1 can also 
be obtained by detecting the change in the Initial Video 
Frame Presentation Start Time VOB_V_SPTM or the Fi- 
nal Video Frame Presentation Termination Time 
VOB_V_EPTM in the navigation pack NV, and extract- 
ing the SCR in the pack immediately before the change. 
Note that the same VOB_V_SPTM and VOB_V_EPTM 
values are recorded to all navigation packs NV in the 
same VOB. As a result, when either of these values 
changes, the VOB has also changed, and by monitoring 
a change in either of these values, it is possible to know 
that the VOB has changed. 

[0673] T1 can be obtained by adding the pack transfer 
time to the SCR value in the pack immediately before 
the VOB changed. Note that the pack transfer time is a 
constant value. 

[0674] STC switching times T2 and T3 can also be 
calculated from the APTS, VDTS, and VPTS values ex- 
tracted immediately before the VOB_V_SPTM or 
VOB_V_EPTM value in the navigation pack NV chang- 
es. 

[0675] T2 is calculated by extracting the APTS from 
the audio packet immediately before the VOB changes, 
and adding the audio reproduction time contained in the 
audio packet of the corresponding APTS value. Note 
that the audio reproduction time contained in the audio 
packet can be calculated from the audio bit rate and the 
packet data size. 



[0676] T3 can be obtained by extracting the VDTS 
from the video packet containing the corresponding 
VDTS immediately before the VOB changes. T3 is thus 
obtained as the time defined by the VDTS. 
5 [0677] T4 is equivalent to the VCB_V_EPTM value, 
which is therefore used. 

(Synchronization controller: embodiment 2\ 

w [0678] A second embodiment of the synchronizer 
2900 shown in Fig. 26 is described according to the 
present invention with reference to Fig. 41 below. As 
shown in Fig. 41 , the synchronizer 2900 comprises a 
system clock STC generator 32030, PTS/DTS extractor 

is 32031 , synchronization controller 32032, video decoder 
synchronizer 32033, sub-picture decoder synchronizer 
32034, audio decoder synchronizer 32035, and system 
decoder sy nchron izer 32036 . Th e STC generator 32030 
generates the system clock for each decoder, and sup- 

20 plies the synchronization system clock STC to the video 
decoder synchronizer 32033, sub-picture decoder syn- 
chronizer 32034, audio decoder synchronizer 32035, 
and system decodersynchronizer32036. The STC gen- 
erator 32030 is a counter that operates at the system 

25 dock. The SCR from the first pack in the first VOB in the 
program chain PGC is set as the initial counter value, 
and is thereafter incremented based on the system 
clock. Note that the initial STC counter value may be 
reset to the APTS or VPTS value. 

30 [0679] Both the audio and video outputs are repro- 
duced synchronized to the respective output clocks. It 
is therefore possible for synchronization to be disrupted 
by the accumulated error in STC, audio output clock, 
and video output clock precision. When this accumulat- 

35 ed error becomes great, the respective decoder buffers 
may be destroyed (by a data underflow or overflow 
state). Therefore, by periodically resetting the system 
clock STC to the APTS synchronized to the audio output 
clock, for example, APTS - STC error accumulation can 

40 be prevented, and the audio can be reproduced without 
interruption. In this case, video synchronization is con- 
trolled by skipping or freezing video output. 
[0680] This type of synchronization control is defined 
as Audio Master synchronization control. 

45 [0681 ] On the other hand, by periodically resetting the 
system clock STC to the VPTS synchronized to the vid- 
eo output clock, VPTS - STC error accumulation can be 
prevented, and the video can be seamlessly repro- 
duced. In this case, audio synchronization is controlled 

50 by skipping or pausing audio output. 

[0682] This type of synchronization control is defined 
as Video Master synchronization control. 
[0683] In the following description of synchronization 
control, an ON synchronization mode refers to STC-ref- 

55 erenced synchronization control, either Audio or Video 
Master. An OFF synchronization mode is when STC-ref- 
erenced synchronization control is not applied. In an 
OFF synchronization mode the audio and video decod- 
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ers sequentially output the audio and video at a defined 
frame frequency based on the respective internal refer- 
ence clocks without referencing the time stamp values 
from the streams, and no timing control is applied to syn- 
chronize the audio and video. 

[0684] The PTS/DTS extractor 32031 extracts the 
presentation time stamp PTS and decoding time stamp 
DTS from the synchronization control data St81 , and 
supplies the PTS and DTS to the respective decoder 
synchronizers. 

[0685] The synchronization controller 32032 gener- 
ates the synchronization control signal specifying an ON 
or OFF synchronization mode for each of the decoder 
synchronizers. This synchronization controller 32032 is 
described in detail below with reference to Fig. 42. 
[0686] If the synchronization control signal from the 
synchronization controller 32032 specifies an ON syn- 
chronization mode, the video decoder synchronizer 
32033 generates the video stream decoding start signal 
St89 based on the STC from the STC generator 32030 
and the decoding time stamp DTS for starting video de- 
coding supplied from the PTS/DTS extractor 32031 . 
More specifically, the video decoder synchronizer 
32033 generates the video stream decoding start signal 
St89 when the STC and DTS match. 
[0687] If the synchronization control signal from the 
synchronization controller 32032 specifies an OFF syn- 
chronization mode, the video decoder synchronizer 
32033 constantly outputs the video stream decoding 
start signal St89. Thus, the video decoder operates in- 
dependently of external control, and is controlled by in- 
ternal state information. 

[0688] If the synchronization control signal from the 
synchronization controller 32032 specifies an ON syn- 
chronization mode, the sub-picture decoder synchroniz- 
er 32034 generates the sub-picture stream decoding 
start signal St91 based on the STC from the STC gen- 
erator 32030 and the decoding time stamp DTS for start- 
ing sub-picture decoding supplied from the PTS/DTS 
extractor 32031 . More specifically, the sub-picture de- 
coder synchronizer 32034 generates the sub-picture 
stream decoding start signal St91 when the STC and 
DTS match. 

[0689] If the synchronization control signal from the 
synchronization controller 32032 specifies an OFF syn- 
chronization mode, the sub-picture decoder synchroniz- 
er 32034 constantly outputs the sub-picture stream de- 
coding start signal St91 . Thus, the sub-picture decoder 
operates independently of external control, and is con- 
trolled by internal state information. 
[0690] If the synchronization control signal from the 
synchronization controller 32032 specifies an ON syn- 
chronization mode, the audio decoder synchronizer 
32035 generates the audio stream decoding start signal 
St93 based on the STC from the STC generator 32030 
and the decoding time stamp DTS for starting audio de- 
coding supplied from the PTS/DTS extractor 32031. 
More specifically, the audio decoder synchronizer 



32035 generates the audio stream decoding start signal 
St93 when the STC and DTS match. 
[0691] If the synchronization control signal from the 
synchronization controller 32032 specifies an OFF syn- 

5 chronization mode, the audio decoder synchronizer 
32035 constantly outputs the audio stream decoding 
start signal St93. Thus, the audio decoder operates in- 
dependently of external control, and is controlled by in- 
ternal state information. 

10 [0692] The system decoder synchronizer 32036 out- 
puts the STC from the STC generator 32030 as the sys- 
tem clock St79. The system clock St79 is used to control 
pack transfers from the stream buffers to the system de- 
coder. In other words, if the STC value matches the SCR 

is value in the pack, the pack data is transferred from the 
stream buffer to the system decoder. 
[0693] The structure and operation of the synchroni- 
zation controller 32032 are described below with refer- 
ence to Fig. 42 and Fig. 43. 

20 [0694] The structure of the synchronization controller 
32032 is shown in Fig. 42. As shown in the figure, the 
synchronization controller 32032 comprises an SCR 
change detector 32040, APTS changing time detector 
32041 , VPTS changing time detector 32042, and syn- 

25 chronization mode selector 32043. 

[0695] The SCR change detector 32040 generates 
and supplies to the synchronization mode selector 
32043 an ACTIVE SCR change detection signal if the 
SCR value in the pack header in the synchronization 

30 control data St81 changes to 0. By thus setting the SCR 
to 0 in the first pack of the VOB reproduced later when 
seamlessly connecting and reproducing two VOB, the 
VOB break point can be easily detected. Note that the 
SCR is not set to 0 when an originally contiguous VOB 

35 is divided in two, i.e., when the SCR is contiguous be- 
tween two VOB. 

[0696] Note, further, that while a 0 value is described 
here, any value whereby the beginning and end of each 
VOB can be easily determined can be used. 

40 [0697] In the case of parental lock control, for exam- 
ple, when a single stream, e.g., VOB #2, is reconnected 
to one of plural possible scenes, e.g., VOB #1 , from the 
multi-scene period enabling parental lock control, each 
of the VOB in the multi-scene period may have a differ- 

45 ent reproduction time, and it is not possible to assign the 
first SCR value in each possible VOB #2 to account for 
all possible connections. A seamless can be achieved 
in such cases, however, by setting the SCR of the first 
pack in VOB #2 to 0. 

50 [0698] The APTS changing time detector 32041 com- 
pares the APTS in the synchronization control data St81 
when the VOB changes with the STC counter value sup- 
plied from the STC generator 32030 shown in Fig. 41 . 
When the STC counter exceeds the APTS, the APTS 

55 changing time detector 32041 generates and inputs to 
the synchronization mode selector 32043 an ACTIVE 
APTS change time detection signal. 
[0699] Note that the method of detecting the APTS 
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when the VOB changes is described below with refer- 
ence to Fig. 43. 

[0700] The VPTS changing time detector 32042 com- 
pares the VPTS in the synchronization control data St81 
when the VOB changes with the STC counter value sup- 
plied from the STC generator 32030. When the STC 
counter exceeds the VPTS, the VPTS changing time de- 
tector 32042 generates and inputs to the synchroniza- 
tion mode selector 32043 an ACTIVE VPTS change 
time detection signal. 

[0701] Note that the method of detecting the VPTS 
when the VOB changes is described below with refer- 
ence to Fig. 43. 

[0702] Based on the SCR change detection signal 
from the SCR change detector 32040, the APTS change 
time detection signal from the APTS changing time de- 
tector 32041 , and the VPTS change time detection sig- 
nal from the VPTS changing time detector 32042, the 
synchronization mode selector 32043 generates the 
synchronization mode selection signals, and outputs to 
the video decoder synchronizer 32033, sub-picture de- 
coder synchronizer 32034, audio decoder synchronizer 

32035, and system decoder synchronizer 32036. The 
STC update signal STCs is also output to the STC gen- 
erator 32030. 

[0703] The respective decoder synchronizers control 
synchronization based on the system clock STC if an 
ON synchronization mode is specified. If an OFF syn- 
chronization mode is specified, the STC is not used for 
synchronization control as described above. 
[0704] The operation of the synchronization mode se- 
lector 32043 is described next with reference to Fig. 43. 
[0705] At step #320430 the STC update signal STCs 
is generated and output to the STC generator 32030, 
and the procedure moves to step #320431. If the STC 
update signal STCs is ACTIVE, the STC generator 
32030 sets a new SCR from the synchronization control 
data St81 as the initial value, and updates the STC. 
[0706] At step #320431 the synchronization mode se- 
lector 32043 outputs a synchronization mode selection 
signal specifying an ON synchronization mode to the de- 
coder synchronizers 32033, 32034, 32035, and 32036. 
The procedure then moves to step #320432. 
[0707] At step #320432 the procedure moves to step 
#320433 if the SCR change detector 32040 detects that 
the SCR has changed. If an SCR change is not detect- 
ed, this step is repeated until an SCR change is detect- 
ed. As a result, an ON synchronization mode continues 
to be output to the decoder synchronizers as long as 
this step executes. 

[0708] At step #320433 the synchronization mode se- 
lector 32043 outputs a synchronization mode selection 
signal specifying an OFF synchronization mode to the 
decoder synchronizers 32033, 32034, 32035, and 

32036. The procedure then moves to step #320434. 
This step thus means that the synchronization mode is 
OFF from the time T1 the VOB changed during pack 
transfer. 



[0709] If both the APTS changing time detector 32041 
and VPTS changing time detector 32042 detect a 
changed time at step #320434, control loops back to 
step #320430, and the synchronization mode is set ON 

5 again at step #320431 . However, if a changed time is 
not detected, step #320434 repeats until both the APTS 
and VPTS are detected to change. This step thus means 
that the decoder synchronizers continue operating in an 
OFF synchronization mode. 

10 [0710] Synchronization control at the start of normal 
reproduction, i.e., at a VOB start without contiguous re- 
production from a preceding VOB, is described next with 
reference to Fig. 44. 

[0711] Fig. 44 shows the relationship between the 
15 system clock reference SCR indicating the time a VOB 
is input to the system decoder data, the audio playback 
starting time information APTS, the decoder reference 
clock STC, and the video playback starting time VPTS, 
referenced to time shown on the horizontal axis with the 
20 values expressed in terms of the presentation start time 
PST on the vertical axis. 

[0712] The point at which the first SCR in the VOB is 
0 is point A. If the first SCR is not 0, e.g., if normal re- 
production is resumed from che middle of a VOB after 

25 a special reproduction mode, the control procedure is 
the same. The times between input and output of the 
audio data and video data, respectively, to and from the 
system decoder are expressed as DTad and DTvd. Be- 
cause DTad < DTvd, and data is recorded at the VCB 

30 beginning referenced to the time it is reproduced, only 
video data is present at point C at the VOB start, and 
the audio data is recorded at point D after a delay of 
DTvd - DTad. 

[0713] Synchronization is controlled at this point as 
35 described below. Video and audio output is first stopped, 
the SCR value from the pack at point A is set to the STC 
generator 32030, and the STC generator 32030 oper- 
ates an internal counter at the system clock and outputs 
the system clock STC. Transfer of the first pack in the 
40 VOB to the system decoder 2500 starts at the same 
time. The following packs are transferred at the SCR re- 
corded in the pack header of each pack referenced to 
the system clock STC produced by the STC generator 
32030. 

45 [071 4] The first video data is then decoded, and video 
output starts at point F when the STC output from the 
STC generator 32030 matches the first VPTS value. 
[071 5] Audio output is similarly controlled: the first au- 
dio data is decoded, and audio output starts at point E, 

50 the moment when the STC output from the STC gener- 
ator 32030 matches the first APTS value. 
[071 6] After reproduction of the start of the VOB thus 
begins, the audio APTS value is periodically set to the 
STC generator 32030 to control synchronization under 

55 Audio Master or Video Master control. 

[071 7] The method of synchronizing seamless repro- 
duction of two VOB is described next. The detection 
method of the SCR change detector 32040, APTS 



93 



EP 0 920 203 B1 



94 



changing time detector 32041 , and VPTS changing time 
detector 32042 in Fig. 42 is particularly described with 
reference to Fig. 45. 

[071 8] Fig. 45 shows the relationship between the re- 
cording positions and values of SCR, APTS, and VPTS 5 
when VOB #1 and VOB #2 are seamlessly reproduced. 
Why the synchronization mode of the decoder synchro- 
nizers must be switched ON and OFF to achieve seam- 
less reproduction is described first. 
[0719] Point G is the time at which the pack being 10 
transferred changes from VOB #1 to VOB #2, H is the 
time the audio output changes, and I is the time the video 
output changes. Because the audio output and video 
output thus change at different times, synchronization 
control cannot be achieved using a single system clock is 
STC. It is therefore necessary to prevent synchroniza- 
tion control using the STC during the period from when 
the SCR changes at time G to when both the APTS and 
VPTS have changed at time I. After both the APTS and 
VPTS have changed at time I, synchronization control 
using the STC is again possible and necessary. 
[0720] The method of detecting the timing at which 
synchronization control is stopped, i.e., when the syn- 
chronization mode is turned OFF, is described next. 
[0721] The timing at which the synchronization mode 
is turned OFF is obtained from the SCR time chart in 
Fig. 45. VOB #1 is being output to the system decoder 
while the SCR value increases, and SCR becomes 0 
only at time G, i.e., when VOB #1 transfer stops and 
VOB #2 transfer begins. Therefore, by detecting the 
time at which the SCR value becomes 0, it is known that 
VOB #2 is being input to the system decoder, and the 
synchronization mode is therefore set OFF at this time 
Tg- 

[0722] It is also possible to detect that the SCR is 0 
when the value is written to the stream buffer. The syn- 
chronization mode can also be set OFF when a 0 SCR 
value is detected written to the stream buffer. 
[0723] The timing at which synchronization control 
begins, i.e., when the synchronization mode is turned 
from OFF to ON, is described next. 
[0724] To start synchronization control it is necessary 
to know when both the audio and video output have 
switched from VOB #1 to VOB #2. The moment when 
the audio output changes to VOB #2 can be known by 
detecting the point H at which the APTS value stops in- 
creasing. Likewise, the moment when the video output 
changes to VOB #2 can be known by detecting the point 
I at which the VPTS value stops increasing. After both 
points H and I have appeared, the synchronization 
mode is immediately set ON at time 71. 
[0725] The timing at which the synchronization mode 
is set OFF can also be delayed to a time between time 
Tg and time Ti, rather than at time Tg when an SCR 
change is detected. By setting the synchronization 
mode OFF at time Th, i.e., the time at which a change 
in APTS or VPTS is detected, between Tg and Ti, the 
duration of the O FF synchronization mode can be short- 



ened. 

[0726] However, when the timing is based on detect- 
ing whether both APTS and VPTS values continue in- 
creasing, it is clear that both APTS and VPTS values 
must decrease when VOB are connected. In other 
words, the last APTS and VPTS values in a VOB must 
be greater than the maximum initial APTS and VPTS 
values in a VOB. 

[0727] The maximum initial (DTad and DTvd) APTS 
and VPTS values are defined as follows. 
[0728] The initial APTS and VPTS values are the 
sums of the video data and audio data storage times in 
the video and audio buffers, and the video reordering 
time (in MPEG video, the picture decode order and pres- 
entation order are not necessarily the same, and pres- 
entation is delayed a maximum one picture from the de- 
coder). Therefore, the sum of the time required for the 
video buffer and audio buffer to fill, and the presentation 
delay (1 frame period) from video reordering, deter- 
mines the maximum initial values for APTS and VPTS. 
[0729] As a result, the last APTS and VPTS values in 
a VOB are always assigned to exceed these maximum 
initial values when the VOB are produced. 
[0730] While it is possible to control the timing at 
which the synchronization mode is turned ON following 
VOB connection by detecting whether both APTS and 
VPTS values continue increasing, it is also possible to 
achieve the same synchronization control by detecting 
the time at which the APTS value drops below an APTS 
threshold value, and the VPTS value drops below an 
VPTS threshold value. 

[0731] These APTS and VPTS threshold values can 
be calculated by using values equal to the maximum in- 
itial APTS and VPTS values of the VOB as the threshold 
values, and calculating them as described above. 
[0732] By applying ON/OFF synchronization mode 
control as described above, seamless reproduction that 
does not disturb the playback condition can be achieved 
in VOB connections. 

[0733] Note that various methods can be used in the 
reproduction apparatus for audio and video synchroni- 
zation control in the second embodiment described 
above. The most common of these methods (1 ) is to re- 
set the system clock STC to the APTS value every few 
seconds, determine whether the VPTS value is fast or 
slow referenced to the system clock STC, and freeze or 
skip video output as necessary. This is the so-called Au- 
dio Master method described above. 
[0734] Alternatively, it is also possible (2) to reset the 
system clock STC to the VPTS value every few sec- 
onds, determine whether the APTS value is fast or slow 
referenced to the system clock STC, and freeze or skip 
audio output as necessary. This is the so-called Video 
Master method described above. 
[0735] Another method (3) is to directly compare the 
APTS and VPTS values, and use either the APTS or 
VPTS value for reference. 

[0736] Regardless of which method is used for ON/ 
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OFF synchronization mode control, however, the same 
effect can be achieved. 

[0737] As described above, two methods are used for 
audio-video synchronization according to the present 
embodiment, Audio Master and Video Master synchro- 
nization control. For Audio Master synchronization con- 
trol the system clock STC is periodically reset to the 
APTS value, whether the VPTS value is earlier or later 
is determined referenced to the reset system clock STC, 
and the video presentation is frozen or skipped as nec- 
essary for synchronization. For Video Master synchro- 
nization control the system clock STC is periodically re- 
set to the VPTS value, whetherthe APTS value is earlier 
or later is determined referenced to the reset system 
clock STC, and the audio presentation is paused or 
skipped as necessary for synchronization. It is also pos- 
sible to directly compare the APTS and VPTS values, 
and control synchronization referenced to either the 
APTS or VPTS value. Regardless of which method is 
used for AV synchronization control, however, the same 
effects can be obtained. 

£0738] Furthermore, while the above embodiments 
have been described using an initial VOB SCR value of 
0, a value other than 0 can be used and the same control 
achieved by adding the first SCR value as an offset val- 
ue to the APTS and VPTS values. 
[0739] It is also possible in the second embodiment 
above to read the STC discontinuity flag STCDCF_reg, 
which specifies whetherthe next cell reproduced needs 
the STC to be reset. If the register reads STC_N RESET, 
the synchronization mode is constantly ON; if 
STC_RESET is stored, ON/OFF synchronization mode 
control can be applied. 

[0740] It is thus possible to decode the data trans- 
ferred to the stream buffer 2400 while synchronizing the 
operation of the various decoders. 
[0741 ] It is therefore possible by means of the present 
invention thus described to maintain synchronization 
between the audio and video data, and seamlessly con- 
nect and reproduce two VOB during reproduction from 
a multi-scene period even when there is no continuity 
between the system clock reference SCR and presen- 
tation time stamp PTS values used for synchronization 
control of the VOB to be contiguously reproduced. 

Industrial Applicability 

[0742] As is apparent from a method and an appara- 
tus according to the present invention for interleaving a 
bitstream to record the interleaved bitstream to a record- 
ing medium and reproduce the recorded bitstream 
therefrom is suitable for the application of an authoring 
system which can generate a new title by editing a title 
constructed by bitstreams carrying various information 
in accordance with the user's request, and is also suit- 
able for a Digital Video Disk System, or DVD System 
being developed recently. 



Claims 

1. A machine readable optical disc (M) containing a 
title having a first data stream and a second data 
5 stream, said disc comprising: 

said first data stream containing video data; 

said second data stream containing video 

10 data; and 

a management information storage area 
comprising management information, 
wherein the management information con- 
tains a flag (SPF) operable to indicate, 

15 when said optical disc (M) is read by a ma- 

chine, whether or not the second data 
stream shall be - during reproduction - pre- 
sented seamlessly afterpresentationofthe 
first data stream, such that the presenta- 

20 tion terminating time of the video data of 

the first data stream is equivatent to the 
presentation starting time of the video data 
of the second data stream. 

25 2. A reproduction apparatus for reproducing an optical 
disc (M) containing a title having a first data stream 
and a second data stream, said disc (M) containing 
said first data stream containing video data, said 
second data stream containing video data, and a 

30 management information storage area comprising 
management information, wherein the manage- 
ment information contains a flag (SPF) operable to 
indicate whether or not the second data stream 
shall be presented seamlessly after presentation of 

35 the first data stream, said reproduction apparatus 
comprising: 

a reading arrangement operable to read the 
management information and the first and sec- 

40 ond data streams from the optical disc; and 

a determining arrangement operable to deter- 
mine, based on the flag (SPF) whether or not 
to present the second data stream seamlessly 
after presentation of the first data stream such 

45 that the presentation terminating time of the 

video data of the first data stream is.equivalent 
to the presentation starting time of the video da- 
ta of the second data stream. 

50 3. A reproduction method for reproducing an optical 
disc containing a title having a first data stream and 
a second data stream, said disc containing, said 
first data stream containing video data, said second 
data stream containing video data, and a manage- 

55 ment information storage area comprising manage- 
ment information, wherein the management infor- 
mation contains a flag (SPF) operable to indicate 
whether or not the second data stream shall be pre- 
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sented seamlessly after presentation of the first da- 
ta stream, said reproduction method comprising: 

reading the management information and the 
first and second data streams from the optical 5 
disc; and 

determining based on the flag (SPF), whether 
or not to present the second data stream seam- 
lessly after presentation of the first data stream 
such that the presentation terminating time of 10 
the video data of the first data stream is equiv- 
alent to the presentation starting time of the vid- 
eo data of the second data stream. 

15 

Patentanspruche 

1 . Maschinenlesbare optische Disk (M) mit einem Tltel 
mit einem ersten Datenstrom und einem zweiten 
Datenstrom, wobei die Disk umfasst: 20 

den ersten Datenstrom, der Videodaten ent- 
halt; 

den zweiten Datenstrom, der Videodaten ent- 
halt; und einen Verwaitungsinformations-Spei- 25 
cherbereich, mit Verwaltungsinfomnation, wo- 
bei die Verwaltungsinformation ein Flag (SPF) 
enthalt, welches anzeigt, ob der zweite Daten- 
strom, wenn die optische Disk (M) von einer 
Maschine gelesen wird - wahrend der Wieder- 30 
gabe - nahtlos nach der Darstellung des ersten 
Datenstroms dargestellt werden soil oder nicht, 
so dass die Darstellungs-Endzeit der Videoda- 
ten des ersten Datenstroms der Darsteilungs- 
Anfangszeit der Videodaten des zweiten Da- 35 
tenstroms entspricht. 

2. Wiedergabevorrichtung zum Wiedergeben einer 
optischen Disk (M) mit einem Titel mit einem ersten 
Datenstrom und einem zweiten Datenstrom, wobei 40 
die Disk (M) den ersten Datenstrom enthalt, der Vi- 
deodaten enthalt, den zweiten Datenstrom, der Vi- 
deodaten enthalt, und einen Verwaltungsinformati- 
ons-Speicherbereich, mit Verwaltungsinformation, 
wobei die Verwaltungsinformation ein Flag (SPF) 45 
enthalt, welches anzeigt, ob der zweite Datenstrom 
nahtlos nach der Darstellung des ersten Daten- 
stroms dargestellt werden soli oder nicht, wobei die 
Wiedergabevorrichtung umfasst: 

50 

eine Leseanordnung zum Lesen der Verwal- 
tungsinformation und des ersten und zweiten 
Datenstroms von der optischen Disk; und 
eine Bestimmungsanordnung zum Bestimmen, 
basierend auf dem Flag (SPF), ob der zweite 55 
Datenstrom nach der Darstellung des ersten 
Datenstroms nahtlos dargestellt werden soil 
oder nicht, so dass die Darstellungs-Endzeit 



der Videodaten des ersten Datenstroms der 
Darstellungs-Anfangszeit der Videodaten des 
zweiten Datenstroms entspricht. 

3. Wiedergabeverfahren zum Wiedergeben einer op- 
tischen Disk mit einem Titel, mit einem ersten Da- 
tenstrom und einem zweiten Datenstrom, wobei die 
Disk den ersten Datenstrom, der Videodaten ent- 
halt, den zweiten Datenstrom, der Videodaten ent- 
halt, und einen Verwaltungsinformations-Speicher- 
bereich enthalt, der Verwaltungsinformationen ent- 
halt, wobei die Verwaltungsinformation ein Flag 
(SPF) enthalt, welches anzeigt, ob der zweite Da- 
tenstrom nahtlos nach der Darstellung des ersten 
Datenstroms dargestellt werden soil oder nicht, wo- 
bei das Wiedergabeverfahren umfasst: 

Lesen der Verwaltungsinformation und des er- 
sten und zweiten Datenstroms von der opti- 
schen Disk; und 

Bestimmen, basierend auf dem Flag (SPF), ob 
der zweite Datenstrom nach der Darstellung 
des ersten Datenstroms nahtlos dargestellt 
werden soil oder nicht, so dass die Darstel- 
lungs-Endzeit der Videodaten des ersten Da- 
tenstroms der Darstellungs-Anfangszeit der Vi- 
deodaten des zweiten Datenstroms entspricht. 



Revendications 

1. Disque optique (M) lisible sur machine contenant 
un titre ayant un premier flux de donnees et un se- 
cond flux de donnees, ledit disque comprenant : 

ledit premier flux de donnees contenant des 
donnees video ; 

ledit second flux de donnees contenant des 
donnees video ; et 

une zone de stockage d'information de gestion 
comprenant de I'information de gestion, ou Tin- 
formation de gestion contient un moyen dedi- 
cation (SPF) operable pour indiquer, quand le- 
dit disque optique (M) est lu par une machine, 
si oui ou non le second flux de donnees peut 
etre - pendant reproduction - presente de ma- 
niere continue apres presentation du premier 
flux de donnees, tel que le temps de terminai- 
son de la presentation des donnees video du 
premier flux de donnees est equivalent au 
temps de depart de la presentation des don- 
nees video du second flux de donnees. 

2. Appareil de reproduction pour reproduire un disque 
optique (M) contenant un titre ayant un premier flux 
de donnees et un second flux de donnees, ledit dis- 
que (M) contenant ledit premier flux de donnees 
contenant des donnees video, ledit second flux de 
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donnees contenant des donnees video, et une zone 
de stockage d'information de gestion comprenant 
de rinformation de gestion, ou rinformation de ges- 
tion contient un moyen dedication (SPF) operable 
pour indiquer si oui ou non le second flux de don- 5 
nees peut etre presente de maniere continue apres 
presentation du premier flux de donnees, ledit ap- 
pareil de reproduction comprenant : 

un assemblage de lecture operable pour lire 10 
rinformation de gestion et les premier et se- 
cond flux de donnees a partir du disque 
optique ; et 

un assemblage de determination operable pour 
determiner, sur la base du moyen dedication *s 
(SPF), si oui ou non il faut presenter le second 
flux de donnees de maniere continue apres 
presentation du premierflux de donnees tel que 
le temps de terminaison de la presentation des 
donnees video du premierflux de donnees est 20 
equivalent au temps de depart de la presenta- 
tion des donnees video du second flux de don- 
nees. 

3. Procede de reproduction pour reproduire un disque 25 
optique contenant un titre ayant un premierflux de 
donnees et un second flux de donnees, ledit disque 
contenant ledit premierflux de donnees contenant 
des donnees video, ledit second flux de donnees 
contenant des donnees video, et une zone de stoc- 30 
kage d'information de gestion comprenant de rin- 
formation de gestion, ou rinformation de gestion 
contient un moyen dedication (SPF) operable pour 
indiquer si oui ou non le second flux de donnees 
peut etre presente de maniere continue apres pre- 35 
sentation du premier flux de donnees, ledit procede 
de reproduction comprenant : 

lire ('information de gestion et les premier et se- 
cond flux de donnees a partir du disque 40 
optique ; et 

determiner, sur la base du moyen dedication 
(SPF), si oui ou non il faut presenter le second 
flux de donnees de maniere continue apres 
presentation du premierflux de donnees tel que 45 
le temps de terminaison de la presentation des 
donnees video du premierflux de donnees est 
equivalent au temps de depart de la presenta- 
tion des donnees video du second flux de don- 
nees. 50 
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